首页> 外文期刊>Expert systems with applications >PrePost(+): An efficient N-lists-based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning
【24h】

PrePost(+): An efficient N-lists-based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning

机译:PrePost(+):一种有效的基于N列表的算法,用于通过“儿童-父母对等”修剪挖掘频繁项集

获取原文
获取原文并翻译 | 示例

摘要

N-list is a novel data structure proposed in recent years. It has been proven to be very efficient for mining frequent itemsets. In this paper, we present PrePost(+), a high-performance algorithm for mining frequent itemsets. It employs N-list to represent itemsets and directly discovers frequent itemsets using a set-enumeration search tree. Especially, it employs an efficient pruning strategy named Children-Parent Equivalence pruning to greatly reduce the search space. We have conducted extensive experiments to evaluate PrePost(+) against three state-of-the-art algorithms, which are PrePost, FIN, and FP-growth*, on six various real datasets. The experimental results show that PrePost(+) is always the fastest one on all datasets. Moreover, PrePost(+) also demonstrates good performance in terms of memory consumption since it use only a litter more memory than FP-growth* and less memory than PrePost and FIN. (C) 2015 Elsevier Ltd. All rights reserved.
机译:N-list是近年来提出的一种新颖的数据结构。它已被证明对于挖掘频繁项集非常有效。在本文中,我们提出了PrePost(+),这是一种用于挖掘频繁项集的高性能算法。它使用N-list表示项目集,并使用集合枚举搜索树直接发现频繁的项目集。特别是,它采用了一种有效的修剪策略,称为“儿童-父母对等修剪”,以大大减少搜索空间。我们已经进行了广泛的实验,针对六个不同的真实数据集,针对三种最新算法,即PrePost,FIN和FP-growth *,对PrePost(+)进行了评估。实验结果表明,PrePost(+)始终是所有数据集中最快的一种。此外,PrePost(+)在内存消耗方面也表现出良好的性能,因为它只使用比FP-growth *多的内存,而比PrePost和FIN少的内存。 (C)2015 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号