Positive False Discovery Rate Estimate in Step-Wise Variable Selection |
| |
Authors: | Lang Li Siu Hui |
| |
Affiliation: | 1. Department of Medicine , Indiana University , Indianapolis, Indiana, USA lali@iupui.edu;3. Department of Medicine , Indiana University , Indianapolis, Indiana, USA |
| |
Abstract: | Selecting predictors to optimize the outcome prediction is an important statistical method. However, it usually ignores the false positives in the selected predictors. In this article, we advocate a conventional stepwise forward variable selection method based on the predicted residual sum of squares, and develop a positive false discovery rate (pFDR) estimate for the selected predictor subset, and a local pFDR estimate to prioritize the selected predictors. This pFDR estimate takes account of the existence of non null predictors, and is proved to be asymptotically conservative. In addition, we propose two views of a variable selection process: an overall and an individual test. An interesting feature of the overall test is that its power of selecting non null predictors increases with the proportion of non null predictors among all candidate predictors. Data analysis is illustrated with an example, in which genetic and clinical predictors were selected to predict the cholesterol level change after four months of tamoxifen treatment, and pFDR was estimated. Our method's performance is evaluated through statistical simulations. |
| |
Keywords: | Cross-validation False discovery rate Multiple-comparisons Pharmacogenetics Variable selection |
|
|