首页 | 本学科首页   官方微博 | 高级检索  
     


A cautionary case study of approaches to the treatment of missing data
Authors:Christopher Paul  William M. Mason  Daniel McCaffrey  Sarah A. Fox
Affiliation:(1) RAND, 4570 Fifth Ave., Suite 600, Pittsburgh, PA 15213, USA;(2) California Center for Population Research, University of California, Los Angeles, 4284 Public Policy Building, PO Box 951484, Los Angeles, CA 90095, USA;(3) Department of Medicine, Division of General Internal Medicine and Health Services Research, University of California, Los Angeles, 1100 Glendon Ave., Suite 2010, Los Angeles, CA 90024, USA
Abstract:This article presents findings from a case study of different approaches to the treatment of missing data. Simulations based on data from the Los Angeles Mammography Promotion in Churches Program (LAMP) led the authors to the following cautionary conclusions about the treatment of missing data: (1) Automated selection of the imputation model in the use of full Bayesian multiple imputation can lead to unexpected bias in coefficients of substantive models. (2) Under conditions that occur in actual data, casewise deletion can perform less well than we were led to expect by the existing literature. (3) Relatively unsophisticated imputations, such as mean imputation and conditional mean imputation, performed better than the technical literature led us to expect. (4) To underscore points (1), (2), and (3), the article concludes that imputation models are substantive models, and require the same caution with respect to specificity and calculability. The research reported here was partially supported by National Institutes of Health, National Cancer Institute, R01 CA65879 (SAF). We thank Nicholas Wolfinger, Naihua Duan, John Adams, John Fox, and the anonymous referees for their thoughtful comments on earlier drafts. The responsibility for any remaining errors is ours alone. Benjamin Stein was exceptionally helpful in orchestrating the simulations at the labs of UCLA Social Science Computing. Michael Mitchell of the UCLA Academic Technology Services Statistical Consulting Group artfully created Fig. 1 using the Stata graphics language; we are most grateful.
Keywords:Missing data  Imputation  Multiple imputation  Casewise deletion
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号