A multiple-case deletion approach for detecting influential points in high-dimensional regression |
| |
Authors: | Tao Wang Qun Li Qingpei Zang |
| |
Affiliation: | 1. School of Mathematical Sciences, Huaiyin Normal University, Huaian City, P. R. China;2. Institute of Statistics and LPMC, Nankai University, Tianjin City, P. R. China;3. Department of Physiology, University of Szeged, Szeged, Hungary |
| |
Abstract: | ABSTRACTIn high-dimensional regression, the presence of influential observations may lead to inaccurate analysis results so that it is a prime and important issue to detect these unusual points before statistical regression analysis. Most of the traditional approaches are, however, based on single-case diagnostics, and they may fail due to the presence of multiple influential observations that suffer from masking effects. In this paper, an adaptive multiple-case deletion approach is proposed for detecting multiple influential observations in the presence of masking effects in high-dimensional regression. The procedure contains two stages. Firstly, we propose a multiple-case deletion technique, and obtain an approximate clean subset of the data that is presumably free of influential observations. To enhance efficiency, in the second stage, we refine the detection rule. Monte Carlo simulation studies and a real-life data analysis investigate the effective performance of the proposed procedure. |
| |
Keywords: | High-dimensional regression Influential points Masking Regression diagnostics |
|
|