首页> 外文会议>International Joint Conference on Rough Sets >Characteristic Sets and Generalized Maximal Consistent Blocks in Mining Incomplete Data
【24h】

Characteristic Sets and Generalized Maximal Consistent Blocks in Mining Incomplete Data

机译:挖掘不完全数据中的特征集和广义最大一致块

获取原文

摘要

Mining incomplete data using approximations based on characteristic sets is a well-established technique. It is applicable to incomplete data sets with a few interpretations of missing attribute values, e.g., lost values and "do not care" conditions. Typically, probabilistic approximations are used in the process. On the other hand, maximal consistent blocks were introduced for incomplete data sets with only "do not care" conditions, using only lower and upper approximations. In this paper we introduce an extension of the maximal consistent blocks to incomplete data sets with any interpretation of missing attribute values and with probabilistic approximations. Additionally, we present results of experiments on mining incomplete data using both characteristic sets and maximal consistent blocks, using lost values and "do not care" conditions. We show that there is a small difference in quality of rule sets induced either way. However, characteristic sets can be computed in polynomial time while computing maximal consistent blocks is associated with exponential time complexity.
机译:使用基于特征集的近似的挖掘不完整的数据是一种良好的技术。它适用于不完整的数据集,具有缺少属性值的少数解释,例如丢失的值和“不关心”条件。通常,在该过程中使用概率逼近。另一方面,仅使用较低和上近似为“不关心”条件的不完全数据集引入最大一致块。在本文中,我们将最大一致块的扩展引入了不完整的数据集,其中包含缺少属性值和概率近似的任何解释。此外,我们使用损失值和“不关心”条件,使用两种特征集和最大一致块的挖掘不完全数据进行实验的结果。我们表明规则集的质量差异很大,诱导了任何一种方式。然而,可以在多项式时间中计算特征集,同时计算最大一致块与指数时间复杂相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号