首页 | 本学科首页   官方微博 | 高级检索  
     


Comparison of algorithms for replacing missing data in discriminant analysis
Authors:J.Twedt Daniel  D.S. Gill
Affiliation:1. Department of Statistics , North Dakota State University , Fargo, 58105, North Dakota;2. Department of Mathematics , California State Polytechnic University , Pomona, California, 91768
Abstract:We examined the impact of different methods for replacing missing data in discriminant analyses conducted on randomly generated samples from multivariate normal and non-normal distributions. The probabilities of correct classification were obtained for these discriminant analyses before and after randomly deleting data as well as after deleted data were replaced using: (1) variable means, (2) principal component projections, and (3) the EM algorithm. Populations compared were: (1) multivariate normal with covariance matrices ∑1=∑2, (2) multivariate normal with ∑1≠∑2 and (3) multivariate non-normal with ∑1=∑2. Differences in the probabilities of correct classification were most evident for populations with small Mahalanobis distances or high proportions of missing data. The three replacement methods performed similarly but all were better than non - replacement.
Keywords:discriminant analysis  missing data  incomplete data  EM algorithm
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号