【24h】

ONLINE DATA STREAM MINING OF RECENT FREQUENT ITEMSETS BASED ON SLIDING WINDOW MODEL

机译:基于滑动窗口模型的最近项在线数据流挖掘

获取原文

摘要

Online data stream mining is one of the most important issues in data mining. Identifying the recent knowledge can provide valuable information for the analysis of the data stream. In this paper, we proposed an one-pass data stream mining algorithm to mine the recent frequent itemsets in data streams with a sliding window basing on transactions. To reduce the cost of time and memory needed to slide the windows, each items is denoted a bit-sequence representations. Basing on Apriori property, this kind of representations can find frequent items in data streams efficiently. We named this method MRFI-SW (Mining Recent Frequent Itemsets by Sliding Window) algorithm. Experiment results show that the proposed algorithm not only attains highly accurate mining result, but also consumes less memory than existing algorithms for mining frequent itemsets over recent data streams.
机译:在线数据流挖掘是数据挖掘中最重要的问题之一。识别最近的知识可以为数据流的分析提供有价值的信息。在本文中,我们提出了一种单次数据流挖掘算法,该算法通过基于事务的滑动窗口来挖掘数据流中最近出现的频繁项集。为了减少滑动窗口所需的时间和内存,每个项目均以位序列表示形式表示。基于Apriori属性,这种表示可以有效地找到数据流中的频繁项。我们将此方法命名为MRFI-SW(通过滑动窗口挖掘最近的频繁项集)算法。实验结果表明,与现有算法相比,该算法不仅可以获得较高的挖掘结果精度,而且消耗的内存更少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号