A Mining Maximal Frequent Itemsets over the Entire History of Data Streams

机译：在整个数据流历史上挖掘最大频繁项目集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mining maximal frequent itemsets has been widely concerned. However, mining data streams is more difficult than mining static databases because of the huge, high-speed and continuous characteristics of streaming data. This paper presents an algorithm, called IDSM-MFI. The algorithm uses a synopsis data structure to store the items of transactions embedded data streams so far. It adopts a top-bottom and bottom-top method to mine the set of all maximal frequent itemsets in landmark windows over data stream, which can be output in real time based on users' specified thresholds. Theoretical analysis and experimental results show that our algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of data stream.

机译：采矿最大频繁项目集已被广泛关注。然而，由于流数据的巨大，高速和连续特性，挖掘数据流比挖掘静态数据库更困难。本文提出了一种称为IDSM-MFI的算法。该算法使用概要数据结构来存储嵌入数据流的事务项目到目前为止。它采用顶部底部和最顶层的方法来挖掘地标窗口中的所有最大频繁项集的数据流，可以基于用户指定的阈值实时输出。理论分析和实验结果表明，我们的算法在整个数据流历史记录中挖掘所有最大频繁项集的集合是有效和可扩展的。

著录项

来源
《International Workshop on Database Technology and Applications》|2009年||共5页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-53;
关键词
data mining; data structures; IDSM-MFI algorithm; bottom-top method; data streams mining; landmark windows; maximal frequent itemset mining; synopsis data structure; top-bottom method; data streams; maximal frequent itemsets;

机译：数据挖掘;数据结构;idsm-mfi算法;底部顶部方法;数据流挖掘;Landmark Windows;最大频繁的项目集;概要数据结构;数据流;最大频繁的项目集;

相似文献

外文文献
中文文献
专利

1. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
2. Mining frequent, maximal and closed frequent itemsets over data stream - a review [J] . M. Jeya Sutha, F. Ramesh Dhanaseelan International journal of data analysis techniques and strategies . 2017,第1期

机译：通过数据流挖掘频繁，最大和关闭频繁项目集
3. Maximal and closed frequent itemsets mining from uncertain database and data stream [J] . International journal of data science . 2019,第3期

机译：从不确定的数据库和数据流挖掘最大和闭合频繁的项目集
4. A Mining Maximal Frequent Itemsets over the Entire History of Data Streams [C] . International Workshop on Database Technology and Applications . 2009

机译：在整个数据流历史上挖掘最大频繁项目集
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. SiBIC: A Web Server for Generating Gene Set Networks Based on Biclusters Obtained by Maximal Frequent Itemset Mining [O] . Kei-ichiro Takahashi, Ichigaku Takigawa, Hiroshi Mamitsuka -1

机译：SiBIC：一种基于Biclusters的基因组网络生成Web服务器该Biclusters通过最大频繁项集挖掘获得
7. Online Mining (Recently) Maximal Frequent Itemsets over Data Streams [O] . Hua-fu Lia, Suh-yin Leea, Man-kwan Shanb 2014

机译：在线挖掘（最近）数据流上的最大频繁项集

A Mining Maximal Frequent Itemsets over the Entire History of Data Streams

摘要

著录项

相似文献

相关主题

期刊订阅