首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Handling co-dependence issues in resampling-based variable selection procedures: a simulation study
Authors:Riccardo De Bin  Willi Sauerbrei
Institution:1. Department of Medical Informatics, Biometry and Epidemiology, Ludwig-Maximilians-Universit?t of Munich, Germany;2. Department of Mathematics, University of Oslo, Oslo, Norwaydebin@math.uio.no;4. Faculty of Medicine and Medical Center, Institute for Medical Biometry and Statistics, University of Freiburg, Freiburg, Germany
Abstract:If a number of candidate variables are available, variable selection is a key task aiming to identify those candidates which influence the outcome of interest. Methods as backward elimination, forward selection, etc. are often implemented, despite their drawbacks. One of these drawbacks is the instability of their results with respect to small perturbations in the data. To handle this issue, resampling-based procedures have been introduced; using a resampling technique, e.g. bootstrap, these procedures generate several pseudo-samples that are used to compute the inclusion frequency of each variable, i.e. the proportion of pseudo-samples in which the variable is selected. Based on the inclusion frequencies, it is possible to discriminate between relevant and irrelevant variables. These procedures may fail in case of correlated variables. To deal with this issue, two procedures based on 2×2 tables of inclusion frequencies have been developed in the literature. In this paper we analyse the behaviours of these two procedures and the role of their tuning parameters in an extensive simulation study.
Keywords:Correlation  inclusion frequencies  model building  bootstrap
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号