首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Construction of disease risk scoring systems using logistic group lasso: application to porcine reproductive and respiratory syndrome survey data
Authors:Hui Lin  Peng Liu  Derald J Holtkamp
Institution:1. Department of Statistics , College of Liberal Arts and Sciences, Iowa State University , Ames , IA , 50011 , USA;2. Department of Veterinary Diagnostic and Production Animal Medicine , College of Veterinary Medicine, Iowa State University , Ames , IA , 50011 , USA
Abstract:We propose to utilize the group lasso algorithm for logistic regression to construct a risk scoring system for predicting disease in swine. This work is motivated by the need to develop a risk scoring system from survey data on risk factor for porcine reproductive and respiratory syndrome (PRRS), which is a major health, production and financial problem for swine producers in nearly every country. Group lasso provides an attractive solution to this research question because of its ability to achieve group variable selection and stabilize parameter estimates at the same time. We propose to choose the penalty parameter for group lasso through leave-one-out cross-validation, using the criterion of the area under the receiver operating characteristic curve. Survey data for 896 swine breeding herd sites in the USA and Canada completed between March 2005 and March 2009 are used to construct the risk scoring system for predicting PRRS outbreaks in swine. We show that our scoring system for PRRS significantly improves the current scoring system that is based on an expert opinion. We also show that our proposed scoring system is superior in terms of area under the curve to that developed using multiple logistic regression model selected based on variable significance.
Keywords:area under the curve  group lasso  multiple logistic regression  PRRS  receiver operating characteristic curve  risk scoring system  survey data
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号