Characteristic Sets and Generalized Maximal Consistent Blocks in Mining Incomplete Data

机译：挖掘不完全数据中的特征集和广义最大一致块

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Mining incomplete data using approximations based on characteristic sets is a well-established technique. It is applicable to incomplete data sets with a few interpretations of missing attribute values, e.g., lost values and "do not care" conditions. Typically, probabilistic approximations are used in the process. On the other hand, maximal consistent blocks were introduced for incomplete data sets with only "do not care" conditions, using only lower and upper approximations. In this paper we introduce an extension of the maximal consistent blocks to incomplete data sets with any interpretation of missing attribute values and with probabilistic approximations. Additionally, we present results of experiments on mining incomplete data using both characteristic sets and maximal consistent blocks, using lost values and "do not care" conditions. We show that there is a small difference in quality of rule sets induced either way. However, characteristic sets can be computed in polynomial time while computing maximal consistent blocks is associated with exponential time complexity.

机译：使用基于特征集的近似的挖掘不完整的数据是一种良好的技术。它适用于不完整的数据集，具有缺少属性值的少数解释，例如丢失的值和“不关心”条件。通常，在该过程中使用概率逼近。另一方面，仅使用较低和上近似为“不关心”条件的不完全数据集引入最大一致块。在本文中，我们将最大一致块的扩展引入了不完整的数据集，其中包含缺少属性值和概率近似的任何解释。此外，我们使用损失值和“不关心”条件，使用两种特征集和最大一致块的挖掘不完全数据进行实验的结果。我们表明规则集的质量差异很大，诱导了任何一种方式。然而，可以在多项式时间中计算特征集，同时计算最大一致块与指数时间复杂相关。

著录项

来源
《International Joint Conference on Rough Sets》|2017年|693p|共10页
会议地点
作者
Patrick G. Clark; Cheng Gao; Jerzy W. Grzymala-Busse; Teresa Mroczek;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Incomplete data; Lost values; "Do not care" conditions; Characteristic sets; Maximal consistent blocks; MLEM2 rule induction algorithm; Probabilistic approximations;

机译：数据不完整;损失值;“不关心”条件;特征集;最大一致块;MLEM2规则感应算法;概率逼近;

相似文献

外文文献
中文文献
专利

1. Characteristic sets and generalized maximal consistent blocks in mining incomplete data [J] . Clark Patrick G., Gao Cheng, Grzymala-Busse Jerzy W., Information Sciences: An International Journal . 2018,第期

机译：挖掘不完全数据的特征集和广义最大一致块
2. Complexity of Rule Sets Mined from Incomplete Data Using Probabilistic Approximations Based on Generalized Maximal Consistent Blocks [J] . Patrick G. Clark, Jerzy W. Grzymala-Busse, Zdzislaw S. Hippe, Procedia Computer Science . 2020,第5期

机译：规则集的复杂性使用基于概括的最大一致块的概率近似地从不完整的数据开采
3. Maximal consistent block technique for rule acquisition in incomplete information systems [J] . Leung Y., Li DY. Information Sciences: An International Journal . 2003,第0期

机译：不完全信息系统中规则获取的最大一致性块技术
4. Complexity of Rule Sets in Mining Incomplete Data Using Characteristic Sets and Generalized Maximal Consistent Blocks [C] . Patrick G. Clark, Cheng Gao, Jerzy W. Grzymala-Busse, International conference on hybrid artificial intelligent systems . 2018

机译：使用特征集和广义最大一致块挖掘不完整数据的规则集的复杂性
5. Incomplete data mining: A rough set approach. [D] . Ajayi, Temidayo B. 2007

机译：不完整的数据挖掘：一种粗糙的方法。
6. SiBIC: A Web Server for Generating Gene Set Networks Based on Biclusters Obtained by Maximal Frequent Itemset Mining [O] . Kei-ichiro Takahashi, Ichigaku Takigawa, Hiroshi Mamitsuka -1

机译：SiBIC：一种基于Biclusters的基因组网络生成Web服务器该Biclusters通过最大频繁项集挖掘获得
7. Generalized Matrix Factorizations as a Unifying Framework for Pattern Set Mining: {C}omplexity Beyond Blocks [O] . Miettinen, P. 2015

机译：广义矩阵因子分解作为模式集挖掘的统一框架：{C}超越块的复杂性

Characteristic Sets and Generalized Maximal Consistent Blocks in Mining Incomplete Data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅