Procedures for the identification of multiple influential observations in linear regression |
| |
Authors: | A.A.M. Nurunnabi Ali S. Hadi A.H.M.R. Imon |
| |
Affiliation: | 1. SLG, Department of Statistics, Rajshahi University, Rajshahi, Bangladesh;2. Department of Mathematics and Actuarial Science, American University in Cairo, New Cairo, Egypt;3. Department of Mathematical Sciences, Ball State University, Muncie, IN, USA |
| |
Abstract: | ![]() Since the seminal paper by Cook (1977) in which he introduced Cook's distance, the identification of influential observations has received a great deal of interest and extensive investigation in linear regression. It is well documented that most of the popular diagnostic measures that are based on single-case deletion can mislead the analysis in the presence of multiple influential observations because of the well-known masking and/or swamping phenomena. Atkinson (1981) proposed a modification of Cook's distance. In this paper we propose a further modification of the Cook's distance for the identification of a single influential observation. We then propose new measures for the identification of multiple influential observations, which are not affected by the masking and swamping problems. The efficiency of the new statistics is presented through several well-known data sets and a simulation study. |
| |
Keywords: | BACON Cook's distance least median of squares least trimmed squares leverage values masking outliers regression diagnostics swamping |
|
|