一类基于模型的聚类方法 A Kind of Model Based Cluster Analysis期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

一类基于模型的聚类方法

引用本文：	魏瑾瑞.一类基于模型的聚类方法[J].统计与信息论坛,2014(2):19-22.

作者姓名：	魏瑾瑞

作者单位：	东北财经大学统计学院,辽宁大连116025

基金项目：	基金项目：辽宁省社会科学规划基金青年项目《抽样调查方法应用的科学性与适用性研究--基于大数据方法的适用性改进》（L13CTJ004）

摘要：	在聚类问题中,若变量之间存在相关性,传统的应对方法主要是考虑采用马氏距离、主成分聚类等方法,但其可操作性或可解释性较差,因此提出一类基于模型的聚类方法,先对变量间的相关性结构建模(作为辅助信息)再做聚类分析。这种方法的优点主要在于:适用范围更宽泛,不仅能处理(线性)相关问题,而且还可以处理变量间存在的其他复杂结构生成的数据聚类问题;各个变量的重要性也可以通过模型的回归系数来体现;比马氏距离更稳健、更具操作性,比主成分聚类更容易得到解释,算法上也更为简洁有效。
关键词：	聚类分析基于模型虚拟个体辅助信息
A Kind of Model Based Cluster Analysis

WEI Jin-rui.A Kind of Model Based Cluster Analysis[J].Statistics & Information Tribune,2014(2):19-22.

Authors:	WEI Jin-rui

Institution:	WEI Jin-rui (School of Statistics, Dongbei University of Finance and Economies, Dalian 116025, China))

Abstract:	In clustering analysis, if there exist the correlation between variables, we use Mahalanobis distance （not realistic） or principal component cluster （poor interpretability）. In this paper, we put forward a kind of model based clustering method, which modeling the correlation between variables as supplementary information to do cluster analysis. The advantage of this approach lies in：（1） the wider range of application, not only （linear） correlation but also other complex structure; （2） the importance of each variable can also be defined by the model （e. g. , regression coefficient）; （3） more robust and operational than Mahalanobis distance, and more understandable and efficient than principal component cluster.

Keywords:	cluster analysis model based virtual individual supplementary information
本文献已被 CNKI 维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏