Mining incomplete data with many attribute-concept values and 'do not care' conditions

机译：挖掘具有许多属性概念值和“无关”条件的不完整数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present novel experimental results comparing two interpretations of missing attribute values: attribute-concept values and "do not care" conditions. Experiments were conducted on 12 data sets with many missing attribute values using the MLEM2 rule induction system. In the experiments, three kinds of probabilistic approximations were used: singleton, subset and concept; with the error rate of the induced rules evaluated by ten-fold cross validation. The results of the experiments compared two interpretations of missing values, attribute-concept values and "do not care" conditions, finding the best result among the three probabilistic approximations. The outcomes show that for two cases the better performance was accomplished using attribute-concept values, for one case the better performance was accomplished using "do not care" conditions. For remaining three cases the difference in performance was not statistically significant (5% significance level).

机译：在本文中，我们提供了新颖的实验结果，比较了缺失属性值的两种解释：属性概念值和“无关”条件。使用MLEM2规则归纳系统对12个具有许多缺失属性值的数据集进行了实验。在实验中，使用了三种概率近似：单例，子集和概念;通过十次交叉验证评估得出的规则的错误率。实验结果比较了缺失值，属性概念值和“无关”条件的两种解释，在三种概率近似中找到了最佳结果。结果表明，在两种情况下，使用属性概念值可以实现更好的性能，在一种情况下，使用“不在乎”条件可以实现更好的性能。对于其余三个案例，绩效差异在统计学上不显着（显着性水平为5％）。

著录项

来源
《IEEE International Congress on Big Data》|2015年|1597-1602|共6页
会议地点
作者
Clark Patrick G.; Grzymala-Busse Jerzy W.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
"do not care" conditions; Data mining; MLEM2 rule induction algorithm; attribute-concept values; probabilistic approximations; rough set theory;

机译：“无关”条件;数据挖掘; MLEM2规则归纳算法;属性概念值;概率逼近;粗糙集理论;

相似文献

外文文献
中文文献
专利

1. Outlier Mining in Medical Databases: An Application of Data Mining in Health Care Management to Detect Abnormal Values Presented In Medical Databases [J] . Varun Kumar, Dharminder Kumar, R.K. Singh International journal of computer science and network security . 2008,第8期

机译：医学数据库中的异常值挖掘：数据挖掘在医疗保健管理中的应用，以检测医学数据库中出现的异常值
2. RMINE: A rough set based data mining prototype for the reasoning of incomplete data in condition-based fault diagnosis [J] . JING RONG LI, LI PHENG KHOO, SHU BENG TOR Journal of Intelligent Manufacturing . 2006,第1期

机译：RMINE：基于粗糙集的数据挖掘原型，用于基于条件的故障诊断中不完整数据的推理
3. Ontology for Data Mining and its Application to Mining Incomplete Data [J] . Hai Wang, Shouhong Wang Journal of database management . 2008,第4期

机译：数据挖掘本体及其在不完整数据挖掘中的应用
4. Mining incomplete data with many attribute-concept values and "do not care" conditions [C] . Clark Patrick G., Grzymala-Busse Jerzy W. IEEE International Congress on Big Data . 2015

机译：使用许多属性概念值和“不关心”条件的挖掘不完整的数据
5. Data Mining Compressed, Incomplete and Inaccurate High Dimensional Data [D] . Hunter, Blake 2011

机译：数据挖掘压缩，不完整和不正确的高维数据
6. A Gaussian process model and Bayesian variable selection for mapping function-valued quantitative traits with incomplete phenotypic data [O] . Jarno Vanhatalo, Zitong Li, Mikko J Sillanpää -1

机译：高斯过程模型和贝叶斯变量选择用于映射具有不完整表型数据的函数值定量性状
7. A NOVEL ALGORITHM FOR ASSOCIATION RULE MINING FROM DATA WITH INCOMPLETE AND MISSING VALUES [O] . K. Rameshkumar 2011

机译：一种新的算法，用于从数据中挖掘不完整和缺失的关联规则

Mining incomplete data with many attribute-concept values and 'do not care' conditions

摘要

著录项

相似文献

相关主题

期刊订阅