Estimating the proportion of true null hypotheses using the pattern of observed p-values期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Estimating the proportion of true null hypotheses using the pattern of observed p-values

Authors:	Tiejun Tong Zeny Feng Julia S. Hilton Hongyu Zhao

Affiliation:	1. Department of Mathematics , Hong Kong Baptist University , Kowloon Tong, Hong Kong , People's Republic of China;2. Institute of Computational and Theoretical Studies, Hong Kong Baptist University , Kowloon Tong, Hong Kong , People's Republic of China;3. Department of Mathematics and Statistics , University of Guelph , Guelph , Canada;4. Department of Applied Mathematics , University of Colorado , Boulder , CO , 80309 , USA;5. Department of Biostatistics , Yale School of Public Health, Yale University , New Haven , CT , USA

Abstract:	Estimating the proportion of true null hypotheses, π₀, has attracted much attention in the recent statistical literature. Besides its apparent relevance for a set of specific scientific hypotheses, an accurate estimate of this parameter is key for many multiple testing procedures. Most existing methods for estimating π₀ in the literature are motivated from the independence assumption of test statistics, which is often not true in reality. Simulations indicate that most existing estimators in the presence of the dependence among test statistics can be poor, mainly due to the increase of variation in these estimators. In this paper, we propose several data-driven methods for estimating π₀ by incorporating the distribution pattern of the observed p-values as a practical approach to address potential dependence among test statistics. Specifically, we use a linear fit to give a data-driven estimate for the proportion of true-null p-values in (λ, 1] over the whole range [0, 1] instead of using the expected proportion at 1?λ. We find that the proposed estimators may substantially decrease the variance of the estimated true null proportion and thus improve the overall performance.

Keywords:	gene expression data multiple testing proportion of true null hypotheses p-value

设为首页 | 免责声明 | 关于勤云 | 加入收藏