首页 | 本学科首页   官方微博 | 高级检索  
     


An NMF-framework for Unifying Posterior Probabilistic Clustering and Probabilistic Latent Semantic Indexing
Authors:Zhong-Yuan Zhang  Tao Li  Chris Ding  Jie Tang
Affiliation:1. School of Statistics and Mathematics, Central University of Finance and Economics, Beijing, China;2. School of Computing and Information Sciences, Florida International University, Miami, Florida, USA;3. Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, Texas, USA;4. Department of Computer Science and Technology, Tsinghua University, Beijing, China
Abstract:In document clustering, a document may be assigned to multiple clusters and the probabilities of a document belonging to different clusters are directly normalized. We propose a new Posterior Probabilistic Clustering (PPC) model that has this normalization property. The clustering model is based on Nonnegative Matrix Factorization (NMF) and flexible such that if we use class conditional probability normalization, the model reduces to Probabilistic Latent Semantic Indexing (PLSI). Systematic comparison and evaluation indicates that PPC is competitive with other state-of-art clustering methods. Furthermore, the results of PPC are more sparse and orthogonal, both of which are highly desirable.
Keywords:Posterior probabilistic clustering  Probabilistic latent semantic indexing  NMF-framework
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号