首页> 外文会议>Proceedings of the 2007 International Conference on Machine Learning and Cybernetics >A SCALABLE ASSOCIATION RULES MINING ALGORITHM BASED ON SORTING, INDEXING AND TRIMING
【24h】

A SCALABLE ASSOCIATION RULES MINING ALGORITHM BASED ON SORTING, INDEXING AND TRIMING

机译:基于排序,索引和修剪的可伸​​缩关联规则挖掘算法

获取原文

摘要

Apriori is an influential and well-known algorithm for mining association rules.However, the main drawback of Apriori algorithm is the large amount of candidate itemsets it generates.Several hash-based algorithms, such as DHP and MPIP, were proposed to deal with the problem.DHP employs hash functions to filter out potential-less candidate itemsets.MPIP further improves DHP by employing minimal perfect hashing functions to avoid generation of candidate itemsets.Though MPIP results in a very promising mining efficiency, the memory space required in MPIP increases rapidly as the number of items grows. To obtain even better mining efficiency while reducing the memory space required, a Sorting-Indexing-Trimming (SIT) algorithm for mining association rules is proposed in this paper.SIT uses the sorting, indexing, and trimming techniques to reduce the amount of itemsets to be considered.Then, to utilize both the advantages of Ariori and MPIP, a revised MPIP algorithm is employed to deal with 2-itemsets, and a revised Apriori algorithm to deal with k-itemsets for k>2. Though the memory space required in SIT is less than MPIP, from the experiment results, SIT outperforms both Apriori and MPIP.
机译:Apriori是一种有影响力的众所周知的挖掘关联规则的算法,但是Apriori算法的主要缺点是生成的候选项目集很多,因此提出了几种基于散列的算法(例如DHP和MPIP)来处理该算法。问题:DHP使用哈希函数来过滤掉潜在的候选项目集.MPIP通过使用最小的完美哈希函数来避免候选项目集的生成来进一步改善DHP。尽管MPIP带来了非常有希望的挖掘效率,但MPIP所需的存储空间迅速增加随着项目数量的增长。为了获得更好的挖掘效率,同时减少所需的存储空间,本文提出了一种用于挖掘关联规则的排序-索引-修剪(SIT)算法.SIT使用排序,索引和修剪技术来减少项目集的数量然后,要同时利用Ariori和MPIP的优势,采用修正的MPIP算法来处理2个项集,并采用修正的Apriori算法来处理k≥2的k个项集。尽管SIT所需的存储空间小于MPIP,但从实验结果来看,SIT的性能优于Apriori和MPIP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号