首页> 外文期刊>The Journal of Systems and Software >A sliding window based algorithm for frequent closed itemset mining over data streams
【24h】

A sliding window based algorithm for frequent closed itemset mining over data streams

机译:基于滑动窗口的频繁闭合项目集数据流挖掘算法

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent pattern mining over data streams is an important problem in the context of data mining and knowledge discovery. Mining frequent closed itemsets within sliding window instead of complete set of frequent itemset is very interesting since it needs a limited amount of memory and processing power. Moreover, handling concept change within a compact set of closed patterns is faster. However, it requires flexible and efficient data structures as well as intuitive algorithms. In this paper, we have introduced an effective and efficient algorithm for closed frequent itemset mining over data streams operating in the sliding window model. This algorithm uses a novel data structure for storing transactions of the window and corresponding frequent closed itemsets. Moreover, the support of a new frequent closed itemset is efficiently computed and an old pattern is removed from the monitoring set when it is no longer frequent closed itemset. Extensive experiments on both real and synthetic data streams show that the proposed algorithm is superior to previously devised algorithms in terms of runtime and memory usage.
机译:在数据挖掘和知识发现的背景下,对数据流的频繁模式挖掘是一个重要的问题。在滑动窗口中挖掘频繁关闭的项目集而不是完整的频繁项目集非常有趣,因为它需要有限的内存和处理能力。而且,在紧凑的一组封闭模式中处理概念更改更快。但是,它需要灵活高效的数据结构以及直观的算法。在本文中,我们介绍了一种有效且高效的算法,用于对在滑动窗口模型中运行的数据流进行频繁的频繁项集挖掘。该算法使用一种新颖的数据结构来存储窗口交易和相应的频繁关闭项目集。此外,可以有效地计算新的频繁关闭项目集的支持,并在不再使用频繁关闭项目集时从监视集中删除旧模式。在真实和合成数据流上的大量实验表明,在运行时和内存使用方面,所提出的算法优于先前设计的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号