Mining data with numerical attributes and missing attribute values — A rough set approach

机译：挖掘具有数字属性和缺少属性值的数据-粗糙集方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper discusses a challenging problem of mining data sets with numerical attributes and, at the same time, with missing attribute values. We distinguish between two interpretations of missing attribute values: lost values and ”do not care” conditions. In our experiments, we used the LERS data mining system, inducing certain and possible rule sets, using rough set theory ideas of lower and upper approximations, respectively. The LERS data mining system has two options for computing approximations: global and local. In our experiments we used both options. Additionally, we used a probabilistic approach to missing attribute values, one of the most successful traditional methods to handle missing attribute values. Using the Wilcoxon matched-pairs signed rank test (5% level of significance for two-tailed test), we observed that the probabilistic approach was either worse or not better than rough set approaches.

机译：本文讨论了一个具有数字属性并且同时缺少属性值的数据集挖掘的难题。我们区分缺少属性值的两种解释：丢失值和“无关”条件。在我们的实验中，我们使用LERS数据挖掘系统，分别使用上下近似的粗糙集理论思想，得出某些和可能的规则集。 LERS数据挖掘系统具有两个用于计算近似值的选项：全局和局部。在我们的实验中，我们同时使用了这两种选择。此外，我们使用概率方法来缺失属性值，这是处理缺失属性值最成功的传统方法之一。使用Wilcoxon配对对的符号秩检验（两尾检验的显着性水平为5％），我们观察到概率方法比粗糙集方法更差或更好。

著录项

来源
《2011 IEEE International Conference on Granular Computing》|2011年|p.214-219|共6页
会议地点
作者
Grzymala-Busse Jerzy W.; Hippe Zdzislaw S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词
Data mining; conditions; incomplete data; lost values “do not care”; rough set theory; rule induction algorithm MLEM2;

机译：数据挖掘;条件;不完整数据;丢失值“无关”;粗糙集理论;规则归纳算法MLEM2;

相似文献

外文文献
中文文献
专利

1. Coping with missing attribute values based on closest fit in preterm birth data: a rough set approach [J] . Jerzy W.Grzymala-Busse, Witold J.Grzymala-Busse, Linda K.Goodwin Computational Intelligence . 2001,第3期

机译：基于最接近早产数据的属性值的缺失应对：一种粗糙集方法
2. A rough sets based characteristic relation approach for dynamic attribute generalization in data mining [J] . Tianrui Li, Da Ruan, Wets Geert, Knowledge-Based Systems . 2007,第5期

机译：基于粗糙集的特征关系方法在数据挖掘中的动态属性综合
3. A Generalized Rough Set Approach to Attribute Generalization in Data Mining [J] . Li Tianrui, Xu Yang Journal of Southwest Jiaotong University . 2000,第1期

机译：数据挖掘中属性归纳的广义粗糙集方法
4. Mining data with numerical attributes and missing attribute values — A rough set approach [C] . Grzymala-Busse Jerzy W., Hippe Zdzislaw S. IEEE International Conference on Granular Computing . 2011

机译：使用数值属性和缺少属性值的挖掘数据 - 粗糙集方法
5. Knowledge discovery in databases: An attribute-oriented rough set approach. [D] . Hu, Xiaohua. 1995

机译：数据库中的知识发现：一种面向属性的粗糙集方法。
6. δ-Cut Decision-Theoretic Rough Set Approach: Model and Attribute Reductions [O] . Hengrong Ju, Huili Dou, Yong Qi, -1

机译：δ-Cut决策理论粗糙集方法：模型和属性约简
7. A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set [O] . Renu Vashist, M. L. Garg 2012

机译：数据集属性值缺失规则的生成和验证的粗糙集方法

Mining data with numerical attributes and missing attribute values — A rough set approach

摘要

著录项

相似文献

相关主题

期刊订阅