Extended BIC for linear regression models with diverging number of relevant features and high or ultra-high feature spaces |
| |
Authors: | Shan Luo Zehua Chen |
| |
Affiliation: | Department of Statistics and Applied Probability, National University of Singapore, 3 Science Drive 2, Singapore 117543, Republic of Singapore |
| |
Abstract: | In many conventional scientific investigations with high or ultra-high dimensional feature spaces, the relevant features, though sparse, are large in number compared with classical statistical problems, and the magnitude of their effects tapers off. It is reasonable to model the number of relevant features as a diverging sequence when sample size increases. In this paper, we investigate the properties of the extended Bayes information criterion (EBIC) (Chen and Chen, 2008) for feature selection in linear regression models with diverging number of relevant features in high or ultra-high dimensional feature spaces. The selection consistency of the EBIC in this situation is established. The application of EBIC to feature selection is considered in a SCAD cum EBIC procedure. Simulation studies are conducted to demonstrate the performance of the SCAD cum EBIC procedure in finite sample cases. |
| |
Keywords: | Diverging number of parameters Feature selection Extended Bayes information criterion High dimensional feature space Penalized likelihood Selection consistency |
本文献已被 ScienceDirect 等数据库收录! |
|