...
首页> 外文期刊>International journal of machine learning and cybernetics >Mining frequent itemsets using the N-list and subsume concepts
【24h】

Mining frequent itemsets using the N-list and subsume concepts

机译:使用N-list和subsume概念挖掘频繁项集

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Frequent itemset mining is a fundamental element with respect to many data mining problems directed at finding interesting patterns in data. Recently the PrePost algorithm, a new algorithm for mining frequent itemsets based on the idea of N-lists, which in most cases outperforms other current state-of-the-art algorithms, has been presented. This paper proposes an improved version of PrePost, the N-list and Subsume-based algorithm for mining Frequent Itemsets (NSFI) algorithm that uses a hash table to enhance the process of creating the N-lists associated with 1-itemsets and an improved N-list intersection algorithm. Furthermore, two new theorems are proposed for determining the "subsume index" of frequent 1-item-sets based on the N-list concept. Using the subsume index, NSFI can identify groups of frequent itemsets without determining the N-list associated with them. The experimental results show that NSFI outperforms PrePost in terms of runtime and memory usage and outperforms dE-clat in terms of runtime.
机译:频繁的项目集挖掘是针对许多数据挖掘问题的基本元素,这些问题旨在发现数据中有趣的模式。最近,已经提出了PrePost算法,这是一种基于N-list的思想来挖掘频繁项集的新算法,在大多数情况下,PrePost算法的性能优于其他最新技术。本文提出了改进版的PrePost,一种用于挖掘频繁项集的N列表和基于主体的算法(NSFI)算法,该算法使用哈希表来增强创建与1项集相关的N列表的过程,并改进了N -list交集算法。此外,提出了两个新的定理,用于基于N-list概念确定频繁的1个项目集的“包含指数”。使用用户索引,NSFI可以识别频繁项目集的组,而无需确定与它们相关联的N-列表。实验结果表明,NSFI在运行时和内存使用方面优于PrePost,在运行时方面优于dE-clat。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号