首页 | 本学科首页   官方微博 | 高级检索  
     


A robust variable screening method for high-dimensional data
Authors:Tao Wang  Lin Zheng  Haiyang Liu
Affiliation:1. School of Mathematical Sciences, Nankai University, Tianjin City, People's Republic of China;2. School of Mathematics and Statistics, Kashgar University, Kashgar City, People's Republic of China;3. Department of Aviation Material Management, Air Force Logistics College, Xuzhou City, People's Republic of China
Abstract:In practice, the presence of influential observations may lead to misleading results in variable screening problems. We, therefore, propose a robust variable screening procedure for high-dimensional data analysis in this paper. Our method consists of two steps. The first step is to define a new high-dimensional influence measure and propose a novel influence diagnostic procedure to remove those unusual observations. The second step is to utilize the sure independence screening procedure based on distance correlation to select important variables in high-dimensional regression analysis. The new influence measure and diagnostic procedure that we developed are model free. To confirm the effectiveness of the proposed method, we conduct simulation studies and a real-life data analysis to illustrate the merits of the proposed approach over some competing methods. Both the simulation results and the real-life data analysis demonstrate that the proposed method can greatly control the adverse effect after detecting and removing those unusual observations, and performs better than the competing methods.
Keywords:High-dimensional data analysis  variable screening  influential observation  distance correlation  bootstrap
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号