首页> 外文会议>International conference on neural information processing;ICONIP 2009 >FIA: Frequent Itemsets Mining Based on Approximate Counting in Data Streams
【24h】

FIA: Frequent Itemsets Mining Based on Approximate Counting in Data Streams

机译:FIA:基于数据流中近似计数的频繁项集挖掘

获取原文

摘要

In this paper, we consider the problem of frequent elements over data stream seeks the set of items whose frequency exceeds σN for a given threshold parameter σ. We refer to this model as the sliding window model. We also use a user specified error parameter, e, to control the accuracy of the mining result. We also propose an FIA (Frequent Itemsets mining based on an Approximate counting) algorithm based on the Chernoff bound with a guarantee of the output quality and also a bound on the memory usage. The proposed algorithm show that runs significantly faster and consumes less memory than do existing algorithms for mining approximate frequent itemsets.
机译:在本文中,我们考虑了数据流中频繁元素的问题,即对于给定的阈值参数σ,其频率超过σN的项的集合。我们将此模型称为滑动窗口模型。我们还使用用户指定的错误参数e来控制挖掘结果的准确性。我们还提出了一种基于Chernoff边界的FIA(基于近似计数的频繁项集挖掘)算法,同时保证了输出质量和内存使用量的边界。与挖掘近似频繁项集的现有算法相比,所提出的算法显示出运行速度更快且消耗的内存更少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号