Overlapping group lasso for high-dimensional generalized linear models |
| |
Authors: | Shengbin Zhou Jingke Zhou Bo Zhang |
| |
Affiliation: | 1. Department of Statistics, Harbin Normal University, Harbin, Chinazhoushengbin1023@126.com;3. Department of Statistics, Harbin Normal University, Harbin, China |
| |
Abstract: | AbstractStructured sparsity has recently been a very popular technique to deal with the high-dimensional data. In this paper, we mainly focus on the theoretical problems for the overlapping group structure of generalized linear models (GLMs). Although the overlapping group lasso method for GLMs has been widely applied in some applications, the theoretical properties about it are still unknown. Under some general conditions, we presents the oracle inequalities for the estimation and prediction error of overlapping group Lasso method in the generalized linear model setting. Then, we apply these results to the so-called Logistic and Poisson regression models. It is shown that the results of the Lasso and group Lasso procedures for GLMs can be recovered by specifying the group structures in our proposed method. The effect of overlap and the performance of variable selection of our proposed method are both studied by numerical simulations. Finally, we apply our proposed method to two gene expression data sets: the p53 data and the lung cancer data. |
| |
Keywords: | Generalized linear models overlap sparsity Oracle inequalities high dimension |
|
|