首页> 外文会议>IEEE Latin American Conference on Computational Intelligence >A methodology for enhancing data quality for classification purposes using attribute-based decision graphs
【24h】

A methodology for enhancing data quality for classification purposes using attribute-based decision graphs

机译:使用基于属性的决策图提高分类目的数据质量的方法

获取原文

摘要

The accuracy performance of a classification system strongly depends on the quality of the data used to train it. Among other issues, noise in the attribute values degrades data quality and interferes badly with the process of automatic classification. This paper proposes a novel method of data cleansing designed for enhancing classification accuracy. The cleansing procedure is based on the Attribute-based Decision Graphs, which are graphs built over the attribute space of a data set. Such graphs gather the underlying patterns from the data set and use this knowledge to check each attribute value for noise. Classification results considering four learning algorithms and five data sets with artificially added noise have shown the effectiveness of the proposed cleansing procedure.
机译:分类系统的准确性性能在很大程度上取决于用于训练它的数据的质量。除其他问题外,属性值中的噪声会降低数据质量,并严重干扰自动分类的过程。本文提出了一种新的数据清理方法,旨在提高分类精度。清理过程基于基于属性的决策图,这些图是在数据集的属性空间之上构建的图。这样的图形从数据集中收集基础模式,并使用此知识来检查每个属性值是否有噪声。考虑到四种学习算法和五个带有人为添加的噪声的数据集的分类结果表明了所提出的清洁程序的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号