首页> 外文会议>Research Issues in Data Engineering, 1997. Proceedings. Seventh International Workshop on >Evaluation of sampling for data mining of association rules

【24h】

Evaluation of sampling for data mining of association rules

机译：关联规则数据挖掘的抽样评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The discovery of association rules is a prototypical problem in data mining. The current algorithms proposed for data mining of association rules make repeated passes over the database to determine the commonly occurring item sets (or set of items). For large databases, the I/O overhead in scanning the database can be extremely high. The authors show that random sampling of transactions in the database is an effective method for finding association rules. Sampling can speed up the mining process by more than an order of magnitude by reducing I/O costs and drastically shrinking the number of transactions to be considered. They may also be able to make the sampled database resident in main-memory. Furthermore, they show that sampling can accurately represent the data patterns in the database with high confidence. They experimentally evaluate the effectiveness of sampling on different databases, and study the relationship between the performance, accuracy, and confidence of the chosen sample.

机译：关联规则的发现是数据挖掘中的典型问题。提出的用于关联规则数据挖掘的当前算法在数据库上反复遍历，以确定常见的项目集（或项目集）。对于大型数据库，扫描数据库的I / O开销可能非常高。作者表明，对数据库中的事务进行随机抽样是查找关联规则的有效方法。通过减少I / O成本并大幅减少要考虑的事务数量，采样可以将挖掘过程加快一个数量级以上。他们也许还可以使采样的数据库驻留在主内存中。此外，他们表明，采样可以高可信度准确地表示数据库中的数据模式。他们通过实验评估了在不同数据库上进行抽样的有效性，并研究了所选样本的性能，准确性和置信度之间的关系。

著录项

来源
《Research Issues in Data Engineering, 1997. Proceedings. Seventh International Workshop on 》|1997年|P.42-50|共9页
会议地点
作者
Zaki M.J.; Parthasarathy S.; Wei Li; Ogihara M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Mining Association Rules from No-SQL data bases using Map-Reduce Fuzzy Association Rule Mining Algorithm [J] . Chatakunta Praveen Kumar, Pole Anjaiah, Santosh Patil, International Journal of Applied Engineering Research . 2017 ,第21aPta1期

机译：使用地图减少模糊关联规则挖掘算法来自No-SQL数据基础的挖掘关联规则
2. Performance Evaluation of Algorithms using a Distributed Data Mining Frame Work based on Association Rule Mining [J] . P.T.Kavitha, Dr.T.Sasipraba International Journal on Computer Science and Engineering . 2011 ,第12期

机译：基于关联规则挖掘的分布式数据挖掘框架算法的性能评估
3. Evaluation of rational nonsteroidal anti-inflammatory drugs and gastro-protective agents use; association rule data mining using outpatient prescription patterns [J] . Oraluck Pattanaprateep, Mark McEvoy, John Attia, BMC Medical Informatics and Decision Making . 2017 ,第1期

机译：评价合理的非甾体抗炎药和胃保护剂的使用;门诊处方模式的关联规则数据挖掘
4. Evaluation of sampling for data mining of association rules [C] . Zaki M.J., Parthasarathy S., Institute of Electric and Electronic Engineer International Workshop on Research Issues in Data Engineering . 1997

机译：关联规则数据挖掘采样评估
5. Sampling: An efficient solution for data mining of association rules. [D] . Zhu, Jingbo. 2003

机译：采样：一种有效的关联规则数据挖掘解决方案。
6. Boosting association rule mining in large datasets via Gibbs sampling [O] . Guoqi Qian, Calyampudi Radhakrishna Rao, Xiaoying Sun, 2016

机译：通过Gibbs采样促进大型数据集中的关联规则挖掘
7. Evaluation of Sampling for Data Mining of Association Rules [O] . Mohammed Javeed Zaki, Srinivasan Parthasarathy, Wei Li, 1997

机译：关联规则数据挖掘的抽样评价
8. Evaluation of Sampling for Data Mining of Association Rules [R] . Zaki, M. J., Parthasarathy, Li, W., 1996

机译：关联规则数据挖掘的抽样评价

Evaluation of sampling for data mining of association rules

摘要

著录项

相似文献

相关主题

期刊订阅