首页 | 本学科首页   官方微博 | 高级检索  
     

基于分布模型的层次聚类算法
引用本文:叶茂,陈勇. 基于分布模型的层次聚类算法[J]. 电子科技大学学报(社会科学版), 2004, 0(2)
作者姓名:叶茂  陈勇
作者单位:电子科技大学计算机科学与工程学院 成都610054(叶茂),深圳大学经济学院 广东深圳518060(陈勇)
摘    要:
提出了一种新的层次聚类算法,先对数据集进行采样,以采样点为中心吸收邻域内的数据点形成子簇,再根据子簇是否相交实现层次聚类。在层次聚类过程中,重新定义了簇与簇之间的距离度量,并以此为基础建立堆结构。利用估计数据点总体分布的思想,证明该算法将逼近最优解。实验结果表明,算法的聚类效果大大优于现有的聚类算法。

关 键 词:聚类  数据挖掘  模式识别  分布

Hierarchical Clustering Algorithm Based on Distribution Model
Ye Mao,Cheng Yong. Hierarchical Clustering Algorithm Based on Distribution Model[J]. Journal of University of Electronic Science and Technology of China(Social Sciences Edition), 2004, 0(2)
Authors:Ye Mao  Cheng Yong
Affiliation:Ye Mao1,Cheng Yong2
Abstract:
A novel agglomerative method is proposed. This algorithm consists of three steps, first samples the dataset, then form the subcluster by absorbing the points in the ?neighborhoods of sample points, at last final clusters are constructed by combining the subclusters. The distance measure of two clusters is redefined. Based on this concept, heap structure is constructed. Formally a theoretical explanation of the algorithm is given using the method approaching the actual distribution. Experimental results show the quality of ADA is much better than very many well-known algorithm CURE.
Keywords:clustering  data mining  pattern recognition  distribution
本文献已被 CNKI 等数据库收录!
正在获取相似文献,请稍候...
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号