Composite quantile regression for massive datasets |
| |
Authors: | Rong Jiang Keming Yu Weimin Qian |
| |
Institution: | 1. Department of Applied Mathematics, College of Science, Donghua University, Shanghai, People's Republic of China;2. Department of Mathematics, Brunel University London, Middlesex, UK;3. School of Mathematical Sciences, Tongji University, Shanghai, People's Republic of China |
| |
Abstract: | Analysis of massive datasets is challenging owing to limitations of computer primary memory. Composite quantile regression (CQR) is a robust and efficient estimation method. In this paper, we extend CQR to massive datasets and propose a divide-and-conquer CQR method. The basic idea is to split the entire dataset into several blocks, applying the CQR method for data in each block, and finally combining these regression results via weighted average. The proposed approach significantly reduces the required amount of primary memory, and the resulting estimate will be as efficient as if the entire data set is analysed simultaneously. Moreover, to improve the efficiency of CQR, we propose a weighted CQR estimation approach. To achieve sparsity with high-dimensional covariates, we develop a variable selection procedure to select significant parametric components and prove the method possessing the oracle property. Both simulations and data analysis are conducted to illustrate the finite sample performance of the proposed methods. |
| |
Keywords: | Massive dataset divide and conquer composite quantile regression variable selection |
|
|