首页> 外文期刊>Expert Systems with Application >Probabilistic frequent itemset mining over uncertain data streams
【24h】

Probabilistic frequent itemset mining over uncertain data streams

机译:不确定数据流上的概率频繁项集挖掘

获取原文
获取原文并翻译 | 示例
       

摘要

This paper considers the problem of mining probabilistic frequent itemsets in the sliding window of an uncertain data stream. We design an effective in-memory index namedPFITto store the data synopsis, so the current probabilistic frequent itemsets can be output in real time. We also propose a depth-first algorithm,PFIMoS, to bottom-up build and maintain thePFITdynamically. Because computing the probabilistic support is time consuming, we propose a method to estimate the range of probabilistic support by using the support and expected support, which can greatly reduce the runtime and memory usage. Nevertheless, massive probabilistic supports have to be computed when the minimum support is low over dense data, which may result in a drastic reduction of computing speed. We further address this problem with a heuristic rule-based algorithm,PFIMoS+, in which an error parameter is introduced to decrease the probabilistic support computing count. Theoretical analysis and experimental studies demonstrate that our proposed algorithms can efficiently reduce computing time and memory, ensure fast and exact mining of probabilistic data streams, and markedly outperform the state-of-the-art algorithmsTODIS-Stream(Sun et al., 2010) andFEMP(Akbarinia & Masseglia, 2013).
机译:本文考虑了在不确定数据流的滑动窗口中挖掘概率频繁项集的问题。我们设计了一个有效的内存索引PFIT来存储数据概要,因此可以实时输出当前的概率频繁项集。我们还提出了深度优先的算法PFIMoS,以自底向上构建和动态维护PFIT。由于计算概率支持非常耗时,因此我们提出了一种通过使用支持和预期支持来估计概率支持范围的方法,该方法可以大大减少运行时间和内存使用量。但是,当最小支持低于密集数据时,必须计算大量的概率支持,这可能会导致计算速度急剧下降。我们通过基于启发式规则的算法PFIMoS +进一步解决了该问题,其中引入了错误参数以减少概率支持计算次数。理论分析和实验研究表明,我们提出的算法可以有效减少计算时间和内存,确保快速准确地挖掘概率数据流,并明显优于最新算法TODIS-Stream(Sun等,2010)和FEMP(Akbarinia&Masseglia,2013)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号