A majority rules approach to data mining

机译：多数规则的数据挖掘方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Knowledge discovery in databases (KDD) offers a methodology for developing tools to extract meaningful knowledge from large volumes of data. We propose a generalized KDD model for supervised training. A main step in this process, data mining, involves the creation of a classification structure that is representative of the concept classes identified in the data set. Data mining incorporates learning which may be supervised or unsupervised and often uses statistical as well as heuristic (machine learning) techniques. Previous research has shown that different supervised models perform better under certain conditions. We tested the extent of overlap of instance classifications between five supervised models in two real world domains. Experimental results showed that in one domain all five models classified 75.8% of the instances identically, correct or incorrect. In the second domain, the corresponding figure was 63.3%. The amount of agreement between models can be used to help determine the nature of the domain and the applicability of a supervised learning approach. We extend the above experimental result and propose a multi model majority rules (MR) data mining technique to learn about the nature of a given domain. We conclude with directions for future work.

机译：数据库中的知识发现（KDD）提供了一种方法，可用于开发工具以从大量数据中提取有意义的知识。我们提出了用于监督训练的广义KDD模型。此过程的主要步骤是数据挖掘，涉及创建一个分类结构，该分类结构代表在数据集中标识的概念类。数据挖掘结合了可以监督或无监督的学习，并且经常使用统计以及启发式（机器学习）技术。先前的研究表明，不同的监督模型在一定条件下表现更好。我们测试了两个现实世界域中五个受监管模型之间实例分类的重叠程度。实验结果表明，在一个域中，所有五个模型对实例的75.8％进行了相同，正确或不正确的分类。在第二个领域，相应的数字是63.3％。模型之间的协议量可用于帮助确定领域的性质和监督学习方法的适用性。我们扩展了上述实验结果，并提出了一种多模型多数规则（MR）数据挖掘技术，以了解给定域的性质。我们以未来工作的方向作为结尾。

著录项

来源
《Intelligent Information Systems, 1997. IIS '97. Proceedings》|1997年|P.100-107|共8页
会议地点
作者
Roiger R.J.; Azarbod C.; Sant R.R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Mining Association Rules from No-SQL data bases using Map-Reduce Fuzzy Association Rule Mining Algorithm [J] . Chatakunta Praveen Kumar, Pole Anjaiah, Santosh Patil, International Journal of Applied Engineering Research . 2017,第21aPta1期

机译：使用地图减少模糊关联规则挖掘算法来自No-SQL数据基础的挖掘关联规则
2. Supreme Court rules out mining development in majority of Yukon watershed [J] . Rob Csernyik CIM Magazine . 2017,第8期

机译：最高法院规定了大多数育空分水岭的矿业发展
3. A setback for mountaintop removal?: Overwhelming majority" of comments flood Office of Surface Mining to oppose stream buffer zone rule change [J] . U.S. Coal Review . 2008,第1686期

机译：山顶拆除的挫折？：“绝大多数”评论涌入露天采矿办公室，反对改变河流缓冲区规则
4. A majority rules approach to data mining [C] . Roiger R.J., Azarbod C., Institute of Electric and Electronic Engineer Conference on Intelligent Information Systems . 1997

机译：大多数规则挖掘方法
5. Mining fuzzy association rules on large numerical data: A data mining system for NAWN. [D] . Komo, Zimpi. 2003

机译：在大型数值数据上挖掘模糊关联规则：NAWN的数据挖掘系统。
6. Social Media Mining for Birth Defects Research: A Rule-Based Bootstrapping Approach to Collecting Data for Rare Health-Related Events on Twitter [O] . Ari Z. Klein, Abeed Sarker, Haitao Cai, -1

机译：用于出生缺陷研究的社交媒体挖掘：基于规则的引导方法用于在Twitter上收集与健康相关的罕见事件的数据
7. Fuzzy C-Means based Inference Mechanism for Association Rule Mining: A Clinical Data Mining Approach [O] . Kapil Chaturvedi, Dr. Ravindra, Dr. D.K. 2015

机译：基于基于C-Meancy的关联规则挖掘推断机制：临床数据挖掘方法
8. Constraint Satisfaction Neural Network Approach for Data Mining Classification and Association Rules in Breast Cancer Databases [R] . Tourassi, G. D. 2003

机译：基于约束满足神经网络的乳腺癌数据挖掘分类与关联规则

A majority rules approach to data mining

摘要

著录项

相似文献

相关主题

期刊订阅