首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于粒度计算的分类属性数据离群点检测算法
引用本文:刘晓平.基于粒度计算的分类属性数据离群点检测算法[J].广州市财贸管理干部学院学报,2014(4):75-78.
作者姓名:刘晓平
作者单位:衢州职业技术学院信息工程学院,浙江衢州324000
基金项目:衢州职业技术学院2013年度院级科研项目“基于覆盖粒计算的海量数据挖掘模型与方法研究”(QZYZ1308)
摘    要:针对基于距离的离群检测算法无法有效应用于分类属性数据集,本文提出一种基于粒度计算理论的对象离群程度计算公式。基于该公式所计算的对象的离群因子值,对所有对象进行排序,将排序后的前k个对象声明为离群点。为了使用相对简单的方法从分类属性数据集中查找离群点,文中构造了一个算法ODAGr C(Outlier detection algorithm based on granular computing)。理论分析和应用实例证明了ODAGr C算法的有效性和可行性。

关 键 词:离群检测  粗糙集  粒度计算  分类信息系统  粒集

Outlier Detection Algorithm for Categorical Data Based on Granular Computing
LIU Xiao-ping.Outlier Detection Algorithm for Categorical Data Based on Granular Computing[J].Journal of Guangzhou Finance & Trade Management Institute,2014(4):75-78.
Authors:LIU Xiao-ping
Institution:LIU Xiao-ping (College of Information Engineering, Quzhou College of Technology, Quzhou 324000, China)
Abstract:Many distance-based outlier detection algorithms were proposed in the past,which can not effectively deal with categorical data set. In this paper,a novel formulation is proposed for the outlier degree of objects that is based on the granular computing theory. Each object on the basis of its Outlier Factor is ranked and we declare the top k objects in this ranking to be outliers. In order to develop relatively straightforward solutions to finding outliers from categorical data set,an algorithm is constructed,named ODAGr C( Outlier detection algorithm based on granular computing). Theory analysis and example calculation both manifest that the ODAGr C is efficient and feasible.
Keywords:outlier detection  rough set  granular computing  categorical information system  granular set
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号