首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Performance of asymmetric links and correction methods for imbalanced data in binary regression
Authors:Alex de la Cruz Huayanay  Jorge L Bazán  Vicente G Cancho  Dipak K Dey
Institution:1. Inter-Institutional Graduation in Statistics, USP/UFSCar, S?o Carlos, Brazil;2. Department of Applied Mathematics and Statistics, University of S?o Paulo, S?o Carlos, Brazil;3. Departmente of Statistics, University of Connecticut, Mansfield, CT, USA
Abstract:In binary regression, imbalanced data result from the presence of values equal to zero (or one) in a proportion that is significantly greater than the corresponding real values of one (or zero). In this work, we evaluate two methods developed to deal with imbalanced data and compare them to the use of asymmetric links. The results based on simulation study show, that correction methods do not adequately correct bias in the estimation of regression coefficients and that the models with power links and reverse power considered produce better results for certain types of imbalanced data. Additionally, we present an application for imbalanced data, identifying the best model among the various ones proposed. The parameters are estimated using a Bayesian approach, considering the Hamiltonian Monte-Carlo method, utilizing the No-U-Turn Sampler algorithm and the comparisons of models were developed using different criteria for model comparison, predictive evaluation and quantile residuals.
Keywords:Asymmetric link  binary regression  imbalanced data  predictive evaluation  quantile residuals  similarity measures
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号