首页 | 本学科首页   官方微博 | 高级检索  
     

可扩展数据清理软件平台的研究
引用本文:陈伟,丁秋林. 可扩展数据清理软件平台的研究[J]. 电子科技大学学报(社会科学版), 2006, 0(1)
作者姓名:陈伟  丁秋林
作者单位:南京审计学院信息科学学院 南京210029(陈伟),南京航空航天大学计算机应用研究所 南京210016(丁秋林)
摘    要:提出一种可扩展的数据清理软件平台,该软件平台具有开放的规则库和算法库,规则库用来存放清理规则,算法库用来存放清理算法,算法库中包含多种算法,并可对其扩展;通过在规则库中定义清理规则以及从算法库中选择合适的清理算法,可使该软件平台适用于不同的数据源,从而使其具有较强的通用性和适应性;通过多种算法的清理,提高了数据清理的综合效果。最后,通过实例验证了该平台的效果及可行性。

关 键 词:数据清理  软件平台  规则库  算法库

Study on the Extensible Data Cleaning Software Platform
CHEN Wei,DING Qiu-lin. Study on the Extensible Data Cleaning Software Platform[J]. Journal of University of Electronic Science and Technology of China(Social Sciences Edition), 2006, 0(1)
Authors:CHEN Wei  DING Qiu-lin
Affiliation:CHEN Wei1,DING Qiu-lin1
Abstract:An extensible data cleaning software platform is proposed, which has open rules library and algorithms library. Rules library is used to store rules and algorithms library is used to store algorithms. Algorithms library has many algorithms and can be extended. Through defining rules in rules library and choosing proper cleaning algorithms from algorithms library, the software platform can be used to various data sources, which makes it universal and adaptive. The synthetic result is improved through data cleaning with many algorithms. Finally, the effect and feasibility of this extensible data cleaning software platform is proved through an example.
Keywords:data cleaning  software platform  rules library  algorithms library
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号