首页 | 本学科首页   官方微博 | 高级检索  
     


An extended sweep operator for the cross validation of variable selection in linear regression
Abstract:In its application to variable selection in the linear model, cross-validation is traditionally applied to an individual model contained in a set of potential models. Each model in the set is cross-validated independently of the rest and the model with the smallest cross-validated sum of squares is selected. In such settings, an efficient algorithm for cross-validation must be able to add and to delete single points quickly from a mixed model. Recent work in variable selection has applied cross-validation to an entire process of variable selection, such as Backward Elimination or Stepwise regression (Thall, Simon and Grier, 1992). The cross-validated version of Backward Elimination, for example, divides the data into an estimation and validation set and performs a complete Backward Elimination on the estimation set, while computing the cross-validated sum of squares at each step with the validation set. After doing this process once, a different validation set is selected and the process is repeated. The final model selection is based on the cross-validated sum of squares for all Backward Eliminations. An optimal algorithm for this application of cross-validation need not be efficient in adding and deleting observations from a single model but must be efficient in computing the cross-validation sum of squares from a series of models using a common validation set. This paper explores such an algorithm based on the sweep operator.
Keywords:Sweep  Cross-validation  Model Selection
相似文献(共20条):
[1]、A variable selection proposal for multiple linear regression analysis[J].Journal of Statistical Computation and Simulation
[2]、Lennart Nordberg.On variable selection in generalized linear and related regression models[J].统计学通讯:理论与方法,2013,42(21):2427-2449.
[3]、Sung-Soo Kim ,Sung H. Park ,W. J. Krzanowski.Simultaneous variable selection and outlier identification in linear regression using the mean-shift outlier model[J].Journal of applied statistics,2008,35(3):283-291.
[4]、Carmen D.S. André,Silvia N. Elian,Subhash C. Narula,Rodrigo A. Tavares.Coefficients of determinations for variable selection in the msae regression[J].统计学通讯:理论与方法,2013,42(3):623-642.
[5]、Simon L. Davies, Andrew A. Neath,Joseph E. Cavanaugh,.Cross validation model selection criteria for linear regression based on the Kullback–Leibler discrepancy[J].Statistical Methodology,2005,2(4):249-266.
[6]、Yazhao Lv,Weihua Zhao,Jicai Liu.Quantile regression and variable selection for the single-index model[J].Journal of applied statistics,2014,41(7):1565-1577.
[7]、Bruce E. Barrett,J. Brian Gray.A computational framework for variable selection in multivariate regression[J].Statistics and Computing,1994,4(3):203-212.
[8]、L. Yang,& R. Tschernig.Multivariate bandwidth selection for local linear regression[J].Journal of the Royal Statistical Society. Series B, Statistical methodology,1999,61(4):793-815.
[9]、Nicolas W. Hengartner, Marten H. Wegkamp, Eric Matzner-Løber.Bandwidth selection for local linear regression smoothers[J].Journal of the Royal Statistical Society. Series B, Statistical methodology,2002,64(4):791-804.
[10]、A stepwise regression algorithm for high-dimensional variable selection[J].Journal of Statistical Computation and Simulation
[11]、Ailing Yan.Adaptive elastic net-penalized quantile regression for variable selection[J].统计学通讯:理论与方法,2013,42(20):5106-5120.
[12]、Howard D. Bondell, Lexin Li.Shrinkage inverse regression estimation for model-free variable selection[J].Journal of the Royal Statistical Society. Series B, Statistical methodology,2009,71(1):287-299.
[13]、Zhixuan Fu,Chirag R. Parikh,Bingqing Zhou.Penalized variable selection in competing risks regression[J].Lifetime data analysis,2017,23(3):353-376.
[14]、Cross validation of ridge regression estimator in autocorrelated linear regression models[J].Journal of Statistical Computation and Simulation
[15]、Yang Yu,Zhihong Zou.Bayesian quantile regression and variable selection for partial linear single-index model: Using free knot spline[J].统计学通讯:模拟与计算,2019,48(5):1429-1449.
[16]、Ines Jlassi.Variable importance assessment in sliced inverse regression for variable selection[J].统计学通讯:模拟与计算,2019,48(1):169-199.
[17]、Sample size calculation for method validation using linear regression[J].Journal of Statistical Computation and Simulation
[18]、Peter J. Smith,Lalith W. Peiris.Added variable plots for linear regression with censored data[J].统计学通讯:理论与方法,2013,42(8):1987-2000.
[19]、Kjell Doksum,Derick Peterson,& Alex Samarov.On variable bandwidth selection in local polynomial regression[J].Journal of the Royal Statistical Society. Series B, Statistical methodology,2000,62(3):431-448.
[20]、S. Min.Bayesian variable selection in Poisson change-point regression analysis[J].统计学通讯:模拟与计算,2017,46(3):2267-2282.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号