首页 | 本学科首页   官方微博 | 高级检索  
     


Monte Carlo studies of bootstrap variability in ROC analysis with data dependency
Authors:Jin Chu Wu  Alvin F. Martin  Raghu N. Kacker
Affiliation:National Institute of Standards and Technology, Gaithersburg, Maryland 20899, USA
Abstract:ROC analysis involving two large datasets is an important method for analyzing statistics of interest for decision making of a classifier in many disciplines. And data dependency due to multiple use of the same subjects exists ubiquitously in order to generate more samples because of limited resources. Hence, a two-layer data structure is constructed and the nonparametric two-sample two-layer bootstrap is employed to estimate standard errors of statistics of interest derived from two sets of data, such as a weighted sum of two probabilities. In this article, to reduce the bootstrap variance and ensure the accuracy of computation, Monte Carlo studies of bootstrap variability were carried out to determine the appropriate number of bootstrap replications in ROC analysis with data dependency. It is suggested that with a tolerance 0.02 of the coefficient of variation, 2,000 bootstrap replications be appropriate under such circumstances.
Keywords:Bootstrap variability  Bootstrap replications  ROC analysis  Data dependency  Large datasets  Standard error
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号