首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Simultaneous variable selection and outlier identification in linear regression using the mean-shift outlier model
Authors:Sung-Soo Kim  Sung H Park  W J Krzanowski
Institution:  a Department of Information Statistics, Korea National Open University, Seoul, Korea; b Department of Statistics, Seoul National University, Seoul, Korea; c School of Engineering, Computer Science and Mathematics, University of Exeter, Exeter, UK
Abstract:We provide a method for simultaneous variable selection and outlier identification using the mean-shift outlier model. The procedure consists of two steps: the first step is to identify potential outliers, and the second step is to perform all possible subset regressions for the mean-shift outlier model containing the potential outliers identified in step 1. This procedure is helpful for model selection while simultaneously considering outlier identification, and can be used to identify multiple outliers. In addition, we can evaluate the impact on the regression model of simultaneous omission of variables and interesting observations. In an example, we provide detailed output from the R system, and compare the results with those using posterior model probabilities as proposed by Hoeting et al. Comput. Stat. Data Anal. 22 (1996), pp. 252-270] for simultaneous variable selection and outlier identification.
Keywords:multiple outliers  variable selection  mean-shift outlier model  all-subset regressions
本文献已被 InformaWorld 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号