Developing Novel and Effective Approach for Association Rule Mining Using Progressive Sampling

机译：使用渐进采样开发新颖有效的关联规则挖掘方法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A challenging task in data mining is the process of discovering association rules from a large database. Most of the existing association rule mining algorithms make repeated passes over the entire database to determine the frequent itemsets, which is likely to incur an extremely high I/O overhead. A simple but an effective way to overcome this problem is to sample the database, such that, it produces rules with highest achievable accuracy on the large database. Numerous researchers have proposed sampling approaches for faster and efficient mining of association rules. In this paper, we propose a novel and effective progressive sampling-based approach for mining association rules from a large database. Initially, the frequent patterns are extracted using Apriori algorithm from an initial sample that is selected based on the temporal characteristics and the size of the database. Using the frequent itemsets generated, the negative border of the initial sample is obtained and sorted. Subsequently, the midpoint itemset in the sorted negative border is scanned in the concrete database to check if it is frequent. Based on the support level computed for the midpoint itemset, the sample size is either progressively increased for determining an optimal sample or association rules are mined by considering it as an optimal sample. The experimental results demonstrate the efficiency of the proposed progressive sampling approach in effective mining of association rules.

机译：数据挖掘中的一项艰巨任务是从大型数据库中发现关联规则的过程。现有的大多数关联规则挖掘算法中，大多数都会对整个数据库进行反复遍历以确定频繁的项目集，这很可能会导致极高的I / O开销。克服此问题的一种简单而有效的方法是对数据库进行采样，以使其在大型数据库上生成具有最高可达到的准确性的规则。许多研究人员提出了采样方法，以更快，更有效地挖掘关联规则。在本文中，我们提出了一种新颖有效的基于渐进采样的方法，用于从大型数据库中挖掘关联规则。最初，使用Apriori算法从基于时间特征和数据库大小选择的初始样本中提取频繁模式。使用生成的频繁项集，可以获取并排序初始样本的负边界。随后，在具体数据库中扫描排序后的负边框中的中点项目集，以检查其是否频繁。基于为中点项目集计算的支持水平，可以逐渐增加样本大小以确定最佳样本，或者通过将关联规则视为最佳样本来挖掘关联规则。实验结果证明了在有效挖掘关联规则中所提出的渐进采样方法的效率。

著录项

来源
《International Conference on Computer and Electrical Engineering;ICCEE '09》|2009年|610-614|共5页
会议地点
作者
Umarani V.; Punithavalli M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Apriori; Association Rule Mining (ARM); Data mining; Frequent Patterns; Negative border; Progressive sampling; Sampling; Temporal;

机译：先验;关联规则挖掘（ARM）;数据挖掘;频繁模式;负边界;渐进采样;采样;时态;

相似文献

外文文献
中文文献
专利

1. Towards an effective automatic query expansion process using an association rule mining approach [J] . Chiraz Latiri, Hatcm Haddad, Tarek Hamrouni Journal of Intelligent Information Systems . 2012,第1期

机译：使用关联规则挖掘方法实现有效的自动查询扩展过程
2. Using the fuzzy weighted association rule mining approach to develop a customer satisfaction product form [J] . Kang Xinhui, Porter Caroline Samantha, Bohemia Erik Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第4Pta2期

机译：使用模糊加权关联规则挖掘方法来开发客户满意度产品形式
3. Developing A Novel Multidimensional Multigranularity Data Mining Approach for Discovering Association Rules [J] . Johannes K. Chiang Computer Science & Information Technology . 2012,第5期

机译：开发一种新的多维多粒度数据挖掘方法来发现关联规则
4. Developing Novel And Effective Approach For Association Rule Mining Using Progressive Sampling [C] . V. Umarani, M. Punithavalli International Conference on Computer and Electrical Engineering . 2009

机译：利用渐进采样制定关联规则挖掘的新颖有效方法
5. Learning dispatching rules via an association rule mining approach [D] . Kim, Dongwook. 2015

机译：通过关联规则挖掘方法学习调度规则
6. Boosting association rule mining in large datasets via Gibbs sampling [O] . Guoqi Qian, Calyampudi Radhakrishna Rao, Xiaoying Sun, 2016

机译：通过Gibbs采样促进大型数据集中的关联规则挖掘
7. A Novel Progressive Sampling based Approach for Effective Mining of Association Rules [O] . V. Umarani, Coimbatore India 2011

机译：基于新的基于递进采样的关联规则有效挖掘方法

Developing Novel and Effective Approach for Association Rule Mining Using Progressive Sampling

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅