Mining frequent itemsets using the N-list and subsume concepts

Bay Vo; Tuong Le; Frans Coenen; Tzung-Pei Hong

首页> 外文期刊>International journal of machine learning and cybernetics >Mining frequent itemsets using the N-list and subsume concepts

【24h】

Mining frequent itemsets using the N-list and subsume concepts

机译：使用N-list和subsume概念挖掘频繁项集

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Frequent itemset mining is a fundamental element with respect to many data mining problems directed at finding interesting patterns in data. Recently the PrePost algorithm, a new algorithm for mining frequent itemsets based on the idea of N-lists, which in most cases outperforms other current state-of-the-art algorithms, has been presented. This paper proposes an improved version of PrePost, the N-list and Subsume-based algorithm for mining Frequent Itemsets (NSFI) algorithm that uses a hash table to enhance the process of creating the N-lists associated with 1-itemsets and an improved N-list intersection algorithm. Furthermore, two new theorems are proposed for determining the "subsume index" of frequent 1-item-sets based on the N-list concept. Using the subsume index, NSFI can identify groups of frequent itemsets without determining the N-list associated with them. The experimental results show that NSFI outperforms PrePost in terms of runtime and memory usage and outperforms dE-clat in terms of runtime.

机译：频繁的项目集挖掘是针对许多数据挖掘问题的基本元素，这些问题旨在发现数据中有趣的模式。最近，已经提出了PrePost算法，这是一种基于N-list的思想来挖掘频繁项集的新算法，在大多数情况下，PrePost算法的性能优于其他最新技术。本文提出了改进版的PrePost，一种用于挖掘频繁项集的N列表和基于主体的算法（NSFI）算法，该算法使用哈希表来增强创建与1项集相关的N列表的过程，并改进了N -list交集算法。此外，提出了两个新的定理，用于基于N-list概念确定频繁的1个项目集的“包含指数”。使用用户索引，NSFI可以识别频繁项目集的组，而无需确定与它们相关联的N-列表。实验结果表明，NSFI在运行时和内存使用方面优于PrePost，在运行时方面优于dE-clat。

著录项

来源
《International journal of machine learning and cybernetics》 |2016年第2期|253-265|共13页
作者
Bay Vo; Tuong Le; Frans Coenen; Tzung-Pei Hong;
展开▼
作者单位

Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam;

Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam;

Department of Computer Science, University of Liverpool, Liverpool, UK;

Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Data mining; Pattern mining; Frequent itemset; N-list; Subsume;

机译：数据挖掘;模式挖掘;频繁项集;N列表;归入;

相似文献

外文文献
中文文献
专利

1. A haoop-based parallel mining of frequent itemsets using N-Lists [J] . Sohrabi Mohammad Karim, Taheri Narjes Journal of the Chinese Institute of Engineers . 2018,第3期

机译：使用n-list的基于统一项目集的平行挖掘
2. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
3. A high utility itemset mining algorithm based on subsume index [J] . Song Wei, Zhang Zihan, Li Jinhong Knowledge and information systems . 2016,第1期

机译：基于包含索引的高效项集挖掘算法
4. Mining Frequent Weighted Itemsets Using Extended N-List and Subsume [C] . Yue Zhai, Jianmei Sun, Ruijie Liu International Conference on Robots Intelligent System . 2019

机译：使用扩展的N-List和Subsume挖掘频繁加权的项目集
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. Unravelling associations between unassigned mass spectrometry peaks with frequent itemset mining techniques [O] . Trung Nghia Vu, Aida Mrzic, Dirk Valkenborg, 2014

机译：利用频繁项集挖掘技术揭示未分配质谱峰之间的关联
7. Mining Frequent Itemsets Using The N-list and Subsume Concepts [O] . Vo B, Le T, Coenen F, 2014

机译：使用N列表和包含概念挖掘频繁项集

Mining frequent itemsets using the N-list and subsume concepts

摘要

著录项

相似文献

相关主题

期刊订阅