An Algorithm for Mining Approximate Frequent Itemsets Over Data Streams

机译：一种在数据流上挖掘近似频繁项集的算法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

It is much more difficult to mining frequent itemsets over data streams than traditional data model because data stream has the following characters: unbounded volume of data,rapid arriving rate of records,uncontrollability of records' arriving order,etc. A novel algorithm is devised based on Lossy Counting to mine frequent itemsets. Logarithmic tilted time window with an attenuation coefficient is adopted to emphasize the importance of new data. Multilayer count queue mode is designed to not only avoid the counter overflowing but also query top-K itemsets quickly using a index table.

机译：与传统的数据模型相比，在数据流上挖掘频繁的项目集要困难得多，因为数据流具有以下特征：无限制的数据量，快速的记录到达率，记录到达顺序的不可控性等。设计了一种基于有损计数的新算法来挖掘频繁项集。采用具有衰减系数的对数倾斜时间窗口来强调新数据的重要性。多层计数队列模式设计为不仅可以避免计数器溢出，还可以使用索引表快速查询前K个项目集。

著录项

来源
《International conference on opto-electronics engineering and information science》|2011年|1444-1447|共4页
会议地点
作者
Na Su; Zhehui Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类光电子技术、激光技术;
关键词
data stream; frequent itemsets; logarithmic tilted time window;

机译：数据流;频繁项集;对数倾斜时间窗;

相似文献

外文文献
中文文献
专利

1. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
2. SWEclat: a frequent itemset mining algorithm over streaming data using Spark Streaming [J] . Xiao Wen, Hu Juan Journal of supercomputing . 2020,第10期

机译：SWECLAT：使用Spark流式传输数据的频繁项目集挖掘算法
3. Approximate mining of global closed frequent itemsets over data streams [J] . Lichao Guo, Hongye Su, Yu Qu Journal of the Franklin Institute . 2011,第6期

机译：通过数据流近似挖掘全局封闭频繁项集
4. An Algorithm for Mining Approximate Frequent Itemsets Over Data Streams [C] . Na Su, Zhehui Wu International conference on opto-electronics engineering and information science . 2011

机译：挖掘数据流近似频繁项目集的算法
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. Genetic Programming and Frequent Itemset Mining to Identify Feature Selection Patterns of iEEG and fMRI Epilepsy Data [O] . Otis Smart, Lauren Burrell -1

机译：遗传程序设计和频繁项集挖掘以识别iEEG和fMRI癫痫数据的特征选择模式
7. An Approximate Approach for Mining Recently Frequent Itemsets from Data Streams* [O] . Jia-ling Koh, Shu-ning Shin 2015

机译：从数据流中挖掘最近频繁项集的近似方法*

An Algorithm for Mining Approximate Frequent Itemsets Over Data Streams

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅