首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Simple Measures of Individual Cluster-Membership Certainty for Hard Partitional Clustering
Authors:Dongmeng Liu  Jinko Graham
Institution:1. Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, CanadaORCID Iconhttps://orcid.org/0000-0002-9466-8225;2. Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, CanadaORCID Iconhttps://orcid.org/0000-0003-4568-1228
Abstract:We propose two probability-like measures of individual cluster-membership certainty that can be applied to a hard partition of the sample such as that obtained from the partitioning around medoids (PAM) algorithm, hierarchical clustering or k-means clustering. One measure extends the individual silhouette widths and the other is obtained directly from the pairwise dissimilarities in the sample. Unlike the classic silhouette, however, the measures behave like probabilities and can be used to investigate an individual’s tendency to belong to a cluster. We also suggest two possible ways to evaluate the hard partition using these measures. We evaluate the performance of both measures in individuals with ambiguous cluster membership, using simulated binary datasets that have been partitioned by the PAM algorithm or continuous datasets that have been partitioned by hierarchical clustering and k-means clustering. For comparison, we also present results from soft-clustering algorithms such as soft analysis clustering (FANNY) and two model-based clustering methods. Our proposed measures perform comparably to the posterior probability estimators from either FANNY or the model-based clustering methods. We also illustrate the proposed measures by applying them to Fisher’s classic dataset on irises.
Keywords:FANNY algorithm  Hard-clustering  Model-based clustering  Silhouette width  Soft-clustering
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号