首页> 外文期刊>International Journal on Computer Science and Engineering >Association Rule Mining for Both Frequent and Infrequent Items Using Particle Swarm Optimization Algorithm
【24h】

Association Rule Mining for Both Frequent and Infrequent Items Using Particle Swarm Optimization Algorithm

机译:基于粒子群算法的频繁项与不频繁项关联规则挖掘

获取原文
           

摘要

In data mining research, generating frequent items from large databases is one of the important issues and the key factor for implementing association rule mining tasks. Mining infrequent items such as relationships among rare but expensive products is another demanding issue which have been shownin some recent studies. Therefore this study considers user assigned threshold values as a constraint which helps users mine those rules which are more interesting for them. In addition, in real world users may prefer to know relationships among frequent items along with infrequent ones. The particle swarm optimization algorithm is an important heuristic technique in recent years and this study uses this technique to mine association rules effectively. If this technique considers user defined threshold values, interesting association rules can be generated more efficiently. Therefore this study proposes a novel approach which includes using particle swarm optimization algorithm to mine association rules from databases. Our implementation of the search strategy includes bitmap representation of nodes in a lexicographic tree and from superset-subset relationship of the nodes it classifies frequent items along with infrequent itemsets. In addition, this approach avoids extra calculation overhead for generating frequent pattern trees and handling large memory which store the support values of candidate itemets. Our experimental results show that this approach efficiently mines association rules. It accesses a database to calculate a support value for fewer numbers of nodes to find frequent itemsets and from that it generates association rules, which dramatically reduces search time. The main aim of this proposed algorithm is to show how heuristic method works on real databases to find all the interesting association rules in an efficient way.
机译:在数据挖掘研究中,从大型数据库中生成频繁项是实现关联规则挖掘任务的重要问题和关键因素之一。最近一些研究表明,挖掘稀有物品(例如稀有但昂贵的产品之间的关系)是另一个要求苛刻的问题。因此,本研究将用户分配的阈值视为约束,这有助于用户挖掘对他们而言更有趣的规则。另外,在现实世界中,用户可能更喜欢了解频繁项目与不频繁项目之间的关系。粒子群优化算法是近年来重要的启发式技术,本研究利用该技术有效地挖掘了关联规则。如果此技术考虑了用户定义的阈值,则可以更有效地生成有趣的关联规则。因此,本研究提出了一种新颖的方法,该方法包括使用粒子群优化算法从数据库中挖掘关联规则。我们对搜索策略的实现包括词典词典树中节点的位图表示,并根据节点的超集-子集关系对频繁项和不频繁项集进行分类。另外,此方法避免了用于生成频繁的模式树和处理存储候选项目集支持值的大内存的额外计算开销。我们的实验结果表明,该方法有效地挖掘了关联规则。它访问数据库以计算较少数量节点的支持值以找到频繁的项目集,并由此生成关联规则,从而大大减少了搜索时间。该算法的主要目的是说明启发式方法如何在实际数据库上工作,从而以有效的方式找到所有有趣的关联规则。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号