EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS

Ye-In Chang; Chia-En Li; Wei-Hau Peng; Syuan-Yun Wang

首页> 外文期刊>International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E >EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS

【24h】

EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

There are multiple applications for using association rules in data streams, such as market analysis, sensor networks and web tracking. Because data streams are continuous, high speed, unbounded and in real time, we can only scan once for data streams. Mining closed frequent itemsets is a further work of mining association rules, where a closed frequent itemset is a frequent itemset which has no superset with the same support. One well-known algorithm for mining closed frequent itemsets based on the sliding window model is the NewMoment algorithm. However, the NewMoment algorithm could not efficiently mine closed frequent itemsets in data streams, since they will generate closed frequent itemsets and many unclosed frequent itemsets. Moreover, when data in the sliding window is incrementally updated, the NewMoment algorithm needs to reconstruct the whole tree structure. On the other hand, a frequent itemset is called maximal, if it is not a subset of any other frequent itemset. One well-known algorithm for mining maximal frequent itemsets based on the sliding window model is called the MFIoSSW algorithm. The MFIoSSW algorithm uses a compact structure to mine the maximal frequent itemsets. However, when the new transaction comes, the number of comparisons between the new transaction and the old transactions is too much. Therefore, in this paper, we propose the Subset-Lattice algorithm, which embed the property of subsets into the lattice structure to efficiently mine closed frequent itemsets and maximal frequent itemsets over a data stream sliding window. Moreover, when data in the sliding window is incrementally updated, our Subset-Lattice algorithms will not reconstruct the whole lattice structure. From our simulation results, we show that our algorithm for mining closed frequent itemsets outperforms the NewMoment algorithm, and our algorithm for mining maximal frequent itemsets also outperforms the MFIoSSW algorithm.

机译：有多种应用程序在数据流中使用关联规则，例如市场分析，传感器网络和Web跟踪。因为数据流是连续的，高速的，无限制的并且是实时的，所以我们只能扫描一次数据流。挖掘封闭的频繁项集是挖掘关联规则的进一步工作，其中封闭的频繁项集是在相同支持下没有超集的频繁项集。一种基于滑动窗口模型的频繁闭合项目集挖掘算法是NewMoment算法。但是，NewMoment算法无法有效地挖掘数据流中的封闭频繁项目集，因为它们将生成封闭频繁项目集和许多未封闭频繁项目集。此外，当滑动窗口中的数据被增量更新时，NewMoment算法需要重建整个树结构。另一方面，如果频繁项集不是任何其他频繁项集的子集，则称为最大项集。一种基于滑动窗口模型的挖掘最大频繁项集的著名算法称为MFIoSSW算法。 MFIoSSW算法使用紧凑的结构来挖掘最大频繁项集。但是，当新事务到来时，新事务和旧事务之间的比较次数太多了。因此，在本文中，我们提出了Subset-Lattice算法，该算法将子集的属性嵌入晶格结构中，以有效地挖掘数据流滑动窗口上的封闭频繁项集和最大频繁项集。此外，当滑动窗口中的数据被增量更新时，我们的子集格算法将不会重建整个晶格结构。从仿真结果可以看出，我们的封闭频繁项目集挖掘算法优于NewMoment算法，而最大频繁项目集挖掘算法也优于MFIoSSW算法。

著录项

来源
《International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E》 |2013年第2期|共13页
作者
Ye-In Chang; Chia-En Li; Wei-Hau Peng; Syuan-Yun Wang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电机;
关键词
Closed frequent itemsets; Data streams; Lattice structure; Maximal frequent itemsets; Sliding window;

机译：封闭的频繁项集;数据流;格结构;最大频繁项集;滑动窗口;

相似文献

外文文献
中文文献
专利

1. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
2. Maximal and closed frequent itemsets mining from uncertain database and data stream [J] . International journal of data science . 2019,第3期

机译：从不确定的数据库和数据流挖掘最大和闭合频繁的项目集
3. An efficient algorithm for mining frequent maximal and closed itemsets [J] . Tarek F. Gharib International Journal of Hybrid Intelligent Systems . 2009,第3期

机译：一种高效的频繁最大封闭项目集挖掘算法
4. An Efficient Subset-Lattice Algorithm for Mining Closed Frequent Itemsets in Data Streams [C] . Chang Ye-In, Li Chia-En, Peng Wei-Hau 2012 Conference on Technologies and Applications of Artificial Intelligence. . 2012

机译：一种高效的子集格算法，用于挖掘数据流中的封闭频繁项集
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data [O] . András Király, Attila Gyenesei, János Abonyi -1

机译：高位二进制数据中基于位表的聚类和频繁封闭项集挖掘
7. An Algorithm for Mining Frequent Closed Itemsets in Data Stream [O] . Dai Caiyan, Chen Ling 2012

机译：数据流中频繁关闭项目集的挖掘算法

EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS

摘要

著录项

相似文献

相关主题

期刊订阅