首页> 外文会议>International Workshop on Database Technology and Applications >A Mining Maximal Frequent Itemsets over the Entire History of Data Streams
【24h】

A Mining Maximal Frequent Itemsets over the Entire History of Data Streams

机译:在整个数据流历史上挖掘最大频繁项目集

获取原文

摘要

Mining maximal frequent itemsets has been widely concerned. However, mining data streams is more difficult than mining static databases because of the huge, high-speed and continuous characteristics of streaming data. This paper presents an algorithm, called IDSM-MFI. The algorithm uses a synopsis data structure to store the items of transactions embedded data streams so far. It adopts a top-bottom and bottom-top method to mine the set of all maximal frequent itemsets in landmark windows over data stream, which can be output in real time based on users' specified thresholds. Theoretical analysis and experimental results show that our algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of data stream.
机译:采矿最大频繁项目集已被广泛关注。然而,由于流数据的巨大,高速和连续特性,挖掘数据流比挖掘静态数据库更困难。本文提出了一种称为IDSM-MFI的算法。该算法使用概要数据结构来存储嵌入数据流的事务项目到目前为止。它采用顶部底部和最顶层的方法来挖掘地标窗口中的所有最大频繁项集的数据流,可以基于用户指定的阈值实时输出。理论分析和实验结果表明,我们的算法在整个数据流历史记录中挖掘所有最大频繁项集的集合是有效和可扩展的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号