A comparative study of the K-means algorithm and the normal mixture model for clustering: Univariate case |
| |
Authors: | Dingxi Qiu Ajit C Tamhane |
| |
Institution: | Department of Industrial Engineering & Management Sciences, Northwestern University, Evanston, IL 60208, USA |
| |
Abstract: | This paper gives a comparative study of the K-means algorithm and the mixture model (MM) method for clustering normal data. The EM algorithm is used to compute the maximum likelihood estimators (MLEs) of the parameters of the MM model. These parameters include mixing proportions, which may be thought of as the prior probabilities of different clusters; the maximum posterior (Bayes) rule is used for clustering. Hence, asymptotically the MM method approaches the Bayes rule for known parameters, which is optimal in terms of minimizing the expected misclassification rate (EMCR). |
| |
Keywords: | 62H30 62F10 |
本文献已被 ScienceDirect 等数据库收录! |
|