首页 | 本学科首页   官方微博 | 高级检索  
     


k-POD: A Method for k-Means Clustering of Missing Data
Authors:Jocelyn T. Chi  Eric C. Chi  Richard G. Baraniuk
Abstract:The k-means algorithm is often used in clustering applications but its usage requires a complete data matrix. Missing data, however, are common in many applications. Mainstream approaches to clustering missing data reduce the missing data problem to a complete data formulation through either deletion or imputation but these solutions may incur significant costs. Our k-POD method presents a simple extension of k-means clustering for missing data that works even when the missingness mechanism is unknown, when external information is unavailable, and when there is significant missingness in the data.

[Received November 2014. Revised August 2015.]
Keywords:Clustering  k-means  Imputation  Majorization-minimization  Missing data.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号