首页> 外文期刊>Expert Systems with Application >On reduction of attributes in inconsistent decision tables based on information entropies and stripped quotient sets
【24h】

On reduction of attributes in inconsistent decision tables based on information entropies and stripped quotient sets

机译:基于信息熵和去除商集的不一致决策表中的属性约简

获取原文
获取原文并翻译 | 示例
       

摘要

Data scenarios on nowadays comprise an enormous number of attributes and instances while not all attributes are necessary and useful for data analytics and knowledge extraction in framing expert and intelligent systems. In such scenarios, removing irrelevant and redundant attributes is a very important task and a challenging research problem. In rough set approach, the attribute reduction method based on the positive region has reported the promising performance in consistent decision tables, while conditional information entropies are employed instead for inconsistent decision tables. However, the complexity of existing methods is still quite high. This paper proposes efficient attribute reduction approaches in inconsistent decision tables by using the concept of stripped quotient sets. Firstly, we present a method for fast determining reducts based on the conditional information entropies including Shannon's and Liang's entropies, called ERED-SQS. Secondly, we define an original notion of attribute reduction based on the stripped quotient set and propose an efficient SRED algorithm to cope with this reduct type. Finally, a simulation performed on 16 datasets varying in a number of objects, up to 5 millions, and a number of attributes, up to 20 thousands, to validate the proposed methods in comparison with the state-of-the-art methods. Experimental results show that the proposed algorithms, ERED-SQS and SRED, have significantly improved the computational time, especially for large data sets. Moreover, the classification accuracy of the reducts obtained from SRED algorithm outperforms that of the reducts based on Shannons and Liangs conditional entropies. This verifies that our proposal is applicable and effective to applications regarding attribute reduction in inconsistent decision tables. (C) 2019 Elsevier Ltd. All rights reserved.
机译:当今的数据场景包含大量的属性和实例,而并非所有属性对于框架专家和智能系统中的数据分析和知识提取都是必需和有用的。在这种情况下,删除不相关和多余的属性是一项非常重要的任务,也是一个具有挑战性的研究问题。在粗糙集方法中,基于正区域的属性约简方法在一致的决策表中报告了有希望的性能,而对于不一致的决策表则使用条件信息熵。但是,现有方法的复杂度仍然很高。本文通过去除商集的概念,提出了在不一致的决策表中有效的属性约简方法。首先,我们提出一种基于条件信息熵(包括香农和梁氏熵)的快速确定还原的方法,称为ERED-SQS。其次,我们基于剥离的商集定义了属性约简的原始概念,并提出了一种有效的SRED算法来应对这种归约类型。最后,对16个数据集进行了模拟,这些数据集包含多达500万个对象和多达2万个属性,这些对象与最新方法相比,可验证所提出的方法。实验结果表明,所提出的算法ERED-SQS和SRED显着改善了计算时间,特别是对于大型数据集。此外,从SRED算法获得的归约词的分类准确性优于基于Shannons和Liangs条件熵的归约词的分类准确性。这证明我们的建议适用于在决策表不一致的情况下进行属性约简的应用。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号