首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A multiple testing protocol for exploratory data analysis and the local misclassification rate
Authors:David D Watts
Institution:Department of Statistics, Oklahoma State University
Abstract:A false discovery rate (FDR) procedure is often employed in exploratory data analysis to determine which among thousands or millions of attributes are worthy of follow-up analysis. However, these methods tend to discover the most statistically significant attributes, which need not be the most worthy of further exploration. This article provides a new FDR-controlling method that allows for the nature of the exploratory analysis to be considered when determining which attributes are discovered. To illustrate, a study in which the objective is to classify discoveries into one of several clusters is considered, and a new FDR method that minimizes the misclassification rate is developed. It is shown analytically and with simulation that the proposed method performs better than competing methods.
Keywords:Classification  False discovery rate  Local false discovery rate  Local misclassification rate  Statistical significance  
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号