首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Using the EM algorithm for Bayesian variable selection in logistic regression models with related covariates
Authors:M D Koslovsky  M D Swartz  L Leon-Novelo  W Chan  A V Wilkinson
Institution:1. Department of Biostatistics, UTHealth, Houston, TX, USA;2. Department of Epidemiology, UTHealth, Austin, TX, USA
Abstract:We develop a Bayesian variable selection method for logistic regression models that can simultaneously accommodate qualitative covariates and interaction terms under various heredity constraints. We use expectation-maximization variable selection (EMVS) with a deterministic annealing variant as the platform for our method, due to its proven flexibility and efficiency. We propose a variance adjustment of the priors for the coefficients of qualitative covariates, which controls false-positive rates, and a flexible parameterization for interaction terms, which accommodates user-specified heredity constraints. This method can handle all pairwise interaction terms as well as a subset of specific interactions. Using simulation, we show that this method selects associated covariates better than the grouped LASSO and the LASSO with heredity constraints in various exploratory research scenarios encountered in epidemiological studies. We apply our method to identify genetic and non-genetic risk factors associated with smoking experimentation in a cohort of Mexican-heritage adolescents.
Keywords:Bayesian inference  binary outcomes  deterministic annealing  expectation-maximization  grouped covariates  heredity constraint  inheritance property  variable selection
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号