首页> 外文会议>Computer Science and Information Technology, 2008 International Multiconference on >Application of clustering and association methods in data cleaning
【24h】

Application of clustering and association methods in data cleaning

机译:聚类和关联方法在数据清理中的应用

获取原文

摘要

Data cleaning is a process of maintaining data quality in information systems. Current data cleaning solutions require reference data to identify incorrect or duplicate entries. This article proposes usage of data mining in the area of data cleaning as effective in discovering reference data and validation rules from the data itself. Two algorithms designed by the author for data attribute correction have been presented. Both algorithms utilize data mining methods. Experimental results show that both algorithms can effectively clean text attributes without external reference data.
机译:数据清理是维护信息系统中数据质量的过程。当前的数据清理解决方案需要参考数据来识别不正确或重复的条目。本文提出在数据清理领域中使用数据挖掘可有效地从数据本身中发现参考数据和验证规则。提出了作者设计的两种用于数据属性校正的算法。两种算法都利用数据挖掘方法。实验结果表明,两种算法都可以有效地清除文本属性,而无需外部参考数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号