Mining incomplete data with many attribute-concept values and 'do not care' conditions

机译：使用许多属性概念值和“不关心”条件的挖掘不完整的数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present novel experimental results comparing two interpretations of missing attribute values: attribute-concept values and "do not care" conditions. Experiments were conducted on 12 data sets with many missing attribute values using the MLEM2 rule induction system. In the experiments, three kinds of probabilistic approximations were used: singleton, subset and concept; with the error rate of the induced rules evaluated by ten-fold cross validation. The results of the experiments compared two interpretations of missing values, attribute-concept values and "do not care" conditions, finding the best result among the three probabilistic approximations. The outcomes show that for two cases the better performance was accomplished using attribute-concept values, for one case the better performance was accomplished using "do not care" conditions. For remaining three cases the difference in performance was not statistically significant (5% significance level).

机译：在本文中，我们提出了小说实验结果比较了对缺少属性值的两个解释：属性 - 概念值和“不关心”条件。使用MLEM2规则感应系统在12个数据集中进行了在12个数据集中进行了实验。在实验中，使用了三种概率近似：单身，子集和概念;通过十倍交叉验证评估的诱导规则的错误率。实验结果比较了缺失值，属性 - 概念值和“不关心”条件的两个解释，在三个概率近似下找到最佳结果。结果表明，对于两种情况，使用属性 - 概念值完成更好的性能，对于一种情况，使用“不关心”条件完成更好的性能。为了剩下三种情况，性能差异没有统计学意义（5％的意义水平）。

著录项

来源
《IEEE International Congress on Big Data》|2015年||共6页
会议地点
作者
Clark Patrick G.; Grzymala-Busse Jerzy W.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
"do not care" conditions; Data mining; MLEM2 rule induction algorithm; attribute-concept values; probabilistic approximations; rough set theory;

机译：“不要关心”条件;数据挖掘;MLEM2规则感应算法;属性 - 概念值;概率近似;粗糙集理论;

相似文献

外文文献
中文文献
专利

1. Outlier Mining in Medical Databases: An Application of Data Mining in Health Care Management to Detect Abnormal Values Presented In Medical Databases [J] . Varun Kumar, Dharminder Kumar, R.K. Singh International journal of computer science and network security . 2008,第8期

机译：医学数据库中的异常值挖掘：数据挖掘在医疗保健管理中的应用，以检测医学数据库中出现的异常值
2. RMINE: A rough set based data mining prototype for the reasoning of incomplete data in condition-based fault diagnosis [J] . JING RONG LI, LI PHENG KHOO, SHU BENG TOR Journal of Intelligent Manufacturing . 2006,第1期

机译：RMINE：基于粗糙集的数据挖掘原型，用于基于条件的故障诊断中不完整数据的推理
3. Ontology for Data Mining and its Application to Mining Incomplete Data [J] . Hai Wang, Shouhong Wang Journal of database management . 2008,第4期

机译：数据挖掘本体及其在不完整数据挖掘中的应用
4. Mining incomplete data with many attribute-concept values and "do not care" conditions [C] . Clark Patrick G., Grzymala-Busse Jerzy W. IEEE International Congress on Big Data . 2015

机译：挖掘具有许多属性概念值和“无关”条件的不完整数据
5. Data Mining Compressed, Incomplete and Inaccurate High Dimensional Data [D] . Hunter, Blake 2011

机译：数据挖掘压缩，不完整和不正确的高维数据
6. A Gaussian process model and Bayesian variable selection for mapping function-valued quantitative traits with incomplete phenotypic data [O] . Jarno Vanhatalo, Zitong Li, Mikko J Sillanpää -1

机译：高斯过程模型和贝叶斯变量选择用于映射具有不完整表型数据的函数值定量性状
7. A NOVEL ALGORITHM FOR ASSOCIATION RULE MINING FROM DATA WITH INCOMPLETE AND MISSING VALUES [O] . K. Rameshkumar 2011

机译：一种新的算法，用于从数据中挖掘不完整和缺失的关联规则

Mining incomplete data with many attribute-concept values and 'do not care' conditions

摘要

著录项

相似文献

相关主题

期刊订阅