Model selection in high-dimensional noisy data: a simulation study |
| |
Authors: | Giovanni Romeo Magne Thoresen |
| |
Affiliation: | Department of Biostatistics, Oslo Centre for Biostatistics and Epidemiology, University of Oslo, Oslo, Norway |
| |
Abstract: | In many practical applications, high-dimensional regression analyses have to take into account measurement error in the covariates. It is thus necessary to extend regularization methods, that can handle the situation where the number of covariates p largely exceed the sample size n, to the case in which covariates are also mismeasured. A variety of methods are available in this context, but many of them rely on knowledge about the measurement error and the structure of its covariance matrix. In this paper, we set the goal to compare some of these methods, focusing on situations relevant for practical applications. In particular, we will evaluate these methods in setups in which the measurement error distribution and dependence structure are not known and have to be estimated from data. Our focus is on variable selection, and the evaluation is based on extensive simulations. |
| |
Keywords: | Measurement error high-dimensional regression lasso matrix uncertainty selector convex conditional lasso variable selection |
|
|