首页> 外文期刊>Intelligent data analysis >An efficient incremental mining algorithm-QSD
【24h】

An efficient incremental mining algorithm-QSD

机译:一种高效的增量挖掘算法-QSD

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The generation of frequent itemsets is an essential and time-consuming step in mining association rules. Most of the studies adopt the Apriori-based approach, which has great effort in generating candidate itemsets and needs multiple database accesses. Recent studies indicate that FP-tree approach has been utilized to avoid the generation of candidate itemsets and scan transaction database only twice, but they work with more complicated data structure. Besides, it needs to adjust the structure of FP-tree when it applied to incremental mining application. It is necessary to adjust the position of an item upward or downward in the structure of FP-tree when a new transaction increases or decreases the accumulation of the item. The process of the adjustment of the structure of FP-tree is the bottlenecks of the FP-tree in incremental mining application. Therefore, algorithms for efficient mining of frequent patterns are in urgent demand. This paper aims to improve both time and space efficiency in mining frequent itemsets and incremental mining application. We propose a novel QSD (Quick Simple Decomposition) algorithm using simple decompose principle which derived from minimal heap tree, we can discover the frequent itemsets quickly under one database scan. Meanwhile, QSD algorithm doesn't need to scan database and reconstruct data structure again when database is updated or minimum support is varied. It can be applied to on-line incremental mining applications without any modification. Comprehensive experiments have been conducted to assess the performance of the proposed algorithm. The experimental results show that the QSD algorithm outperforms previous algorithms.
机译:频繁项集的生成是挖掘关联规则中必不可少且耗时的步骤。大多数研究都采用基于Apriori的方法,该方法在生成候选项目集方面付出了巨大的努力,并且需要多个数据库访问权限。最近的研究表明,使用FP树方法可以避免生成候选项目集,并且仅扫描事务数据库两次,但是它们可以处理更复杂的数据结构。此外,在将FP-tree应用于增量挖掘应用时,还需要调整FP-tree的结构。当新交易增加或减少项目的累积时,有必要在FP-tree结构中向上或向下调整项目的位置。 FP树结构的调整过程是FP树在增量采矿应用中的瓶颈。因此,迫切需要用于有效挖掘频繁模式的算法。本文旨在提高频繁项集挖掘和增量挖掘应用中的时间和空间效率。我们提出了一种基于最小分解树的简单分解原理的新型QSD(Quick Simple Decomposition,快速简单分解)算法,可以在一次数据库扫描中快速发现频繁项集。同时,在更新数据库或更改最小支持时,QSD算法无需再次扫描数据库并重新构建数据结构。无需进行任何修改即可将其应用于在线增量采矿应用程序。已经进行了全面的实验,以评估所提出算法的性能。实验结果表明,QSD算法优于以前的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号