Evaluation of Sampling for Data Mining of Association Rules

机译：关联规则数据挖掘的抽样评价

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Data mining is an emerging research area, whose goal is to extract significantpatterns or interesting rules from large databases. High-level inference from large volumes of routine business data can provide valuable information to businesses, such as customer buying patterns, shelving criterion in supermarkets and stock trends. However, many algorithms proposed for data mining of association rules make repeated passes over the database to determine the commonly occurring itemsets (or set of items). For large databases, the I/O overhead in scanning the database can be extremely high. In this paper we show that random sampling of transactions in the database is an effective method for finding association rules. Sampling can speed up the mining process by more than an order of magnitude by reducing I/O costs and drastically shrinking the number of transaction to be considered. We may also be able to make the sampled database resident in main-memory. Furthermore, we show that sampling can accurately represent the data patterns in the database with high confidence. We experimentally evaluate the effectiveness of sampling on three databases.

著录项

作者
Zaki, M. J.; Parthasarathy; Li, W.; Ogihara, M.;
展开▼
作者单位

展开▼
年度 1996
页码
总页数 17
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Data bases; Statistical samples; Algorithms; Optimization; Data management; Probability distribution functions; Random variables; Statistical inference; Statistical data; Accuracy; Learning machines; Input output processing; Rule based systems; Pattern re;

机译：数据库;统计样本;算法;优化;数据管理;概率分布函数;随机变量;统计推断;统计数据;准确性;学习机器;输入输出处理;基于规则的系统;模式重新;

相似文献

外文文献
中文文献
专利

1. Mining Association Rules from No-SQL data bases using Map-Reduce Fuzzy Association Rule Mining Algorithm [J] . Chatakunta Praveen Kumar, Pole Anjaiah, Santosh Patil, International Journal of Applied Engineering Research . 2017,第21aPta1期

机译：使用地图减少模糊关联规则挖掘算法来自No-SQL数据基础的挖掘关联规则
2. Performance Evaluation of Algorithms using a Distributed Data Mining Frame Work based on Association Rule Mining [J] . P.T.Kavitha, Dr.T.Sasipraba International Journal on Computer Science and Engineering . 2011,第12期

机译：基于关联规则挖掘的分布式数据挖掘框架算法的性能评估
3. Evaluation of rational nonsteroidal anti-inflammatory drugs and gastro-protective agents use; association rule data mining using outpatient prescription patterns [J] . Oraluck Pattanaprateep, Mark McEvoy, John Attia, BMC Medical Informatics and Decision Making . 2017,第1期

机译：评价合理的非甾体抗炎药和胃保护剂的使用;门诊处方模式的关联规则数据挖掘
4. Evaluation of sampling for data mining of association rules [C] . Zaki M.J., Parthasarathy S., Wei Li, Research Issues in Data Engineering, 1997. Proceedings. Seventh International Workshop on . 1997

机译：关联规则数据挖掘的抽样评估
5. Sampling: An efficient solution for data mining of association rules. [D] . Zhu, Jingbo. 2003

机译：采样：一种有效的关联规则数据挖掘解决方案。
6. Boosting association rule mining in large datasets via Gibbs sampling [O] . Guoqi Qian, Calyampudi Radhakrishna Rao, Xiaoying Sun, 2016

机译：通过Gibbs采样促进大型数据集中的关联规则挖掘
7. Evaluation of Sampling for Data Mining of Association Rules [O] . Mohammed Javeed Zaki, Srinivasan Parthasarathy, Wei Li, 1997

机译：关联规则数据挖掘的抽样评价

Evaluation of Sampling for Data Mining of Association Rules

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅