首页> 外文会议>International conference on opto-electronics engineering and information science >An Algorithm for Mining Approximate Frequent Itemsets Over Data Streams
【24h】

An Algorithm for Mining Approximate Frequent Itemsets Over Data Streams

机译:一种在数据流上挖掘近似频繁项集的算法

获取原文

摘要

It is much more difficult to mining frequent itemsets over data streams than traditional data model because data stream has the following characters: unbounded volume of data,rapid arriving rate of records,uncontrollability of records' arriving order,etc. A novel algorithm is devised based on Lossy Counting to mine frequent itemsets. Logarithmic tilted time window with an attenuation coefficient is adopted to emphasize the importance of new data. Multilayer count queue mode is designed to not only avoid the counter overflowing but also query top-K itemsets quickly using a index table.
机译:与传统的数据模型相比,在数据流上挖掘频繁的项目集要困难得多,因为数据流具有以下特征:无限制的数据量,快速的记录到达率,记录到达顺序的不可控性等。设计了一种基于有损计数的新算法来挖掘频繁项集。采用具有衰减系数的对数倾斜时间窗口来强调新数据的重要性。多层计数队列模式设计为​​不仅可以避免计数器溢出,还可以使用索引表快速查询前K个项目集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号