首页> 外文会议>2011 IEEE International Conference on Granular Computing >Mining data with numerical attributes and missing attribute values — A rough set approach
【24h】

Mining data with numerical attributes and missing attribute values — A rough set approach

机译:挖掘具有数字属性和缺少属性值的数据-粗糙集方法

获取原文

摘要

This paper discusses a challenging problem of mining data sets with numerical attributes and, at the same time, with missing attribute values. We distinguish between two interpretations of missing attribute values: lost values and ”do not care” conditions. In our experiments, we used the LERS data mining system, inducing certain and possible rule sets, using rough set theory ideas of lower and upper approximations, respectively. The LERS data mining system has two options for computing approximations: global and local. In our experiments we used both options. Additionally, we used a probabilistic approach to missing attribute values, one of the most successful traditional methods to handle missing attribute values. Using the Wilcoxon matched-pairs signed rank test (5% level of significance for two-tailed test), we observed that the probabilistic approach was either worse or not better than rough set approaches.
机译:本文讨论了一个具有数字属性并且同时缺少属性值的数据集挖掘的难题。我们区分缺少属性值的两种解释:丢失值和“无关”条件。在我们的实验中,我们使用LERS数据挖掘系统,分别使用上下近似的粗糙集理论思想,得出某些和可能的规则集。 LERS数据挖掘系统具有两个用于计算近似值的选项:全局和局部。在我们的实验中,我们同时使用了这两种选择。此外,我们使用概率方法来缺失属性值,这是处理缺失属性值最成功的传统方法之一。使用Wilcoxon配对对的符号秩检验(两尾检验的显着性水平为5%),我们观察到概率方法比粗糙集方法更差或更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号