...
首页> 外文期刊>Knowledge and information systems >EFIM: a fast and memory efficient algorithm for high-utility itemset mining
【24h】

EFIM: a fast and memory efficient algorithm for high-utility itemset mining

机译:EFIM:一种快速和记忆高效算法的高实用程序项集挖掘

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, high-utility itemset mining has emerged as an important data mining task. However, it remains computationally expensive both in terms of runtime and memory consumption. It is thus an important challenge to design more efficient algorithms for this task. In this paper, we address this issue by proposing a novel algorithm named EFIM (EFficient high-utility Itemset Mining), which introduces several new ideas to more efficiently discover high-utility itemsets. EFIM relies on two new upper bounds named revised sub-tree utility and local utility to more effectively prune the search space. It also introduces a novel array-based utility counting technique named Fast Utility Counting to calculate these upper bounds in linear time and space. Moreover, to reduce the cost of database scans, EFIM proposes efficient database projection and transaction merging techniques named High-utility Database Projection and High-utility Transaction Merging (HTM), also performed in linear time. An extensive experimental study on various datasets shows that EFIM is in general two to three orders of magnitude faster than the state-of-art algorithms HUP, HUI-Miner, HUP-Miner, FHM and UP-Growth+ on dense datasets and performs quite well on sparse datasets. Moreover, a key advantage of EFIM is its low memory consumption.
机译:近年来,高实用项目集挖掘已成为一个重要的数据挖掘任务。然而,在运行时和存储器消耗方面,它仍然昂贵。因此,为此任务设计更有效的算法是一个重要的挑战。在本文中,我们通过提出名为EFIM的新算法(高效的高实用程序挖掘)来解决这个问题,这引入了几个新的想法,以更有效地发现高实用程序项集。 efim依赖于名为修改后的子树实用程序和本地实用程序的两个新的上限,以更有效地修剪搜索空间。它还介绍了一种基于阵列的实用程序计数技术,名为Fast Utility Counting,以计算线性时间和空间中的这些上限。此外,为了降低数据库扫描的成本,EFIM提出了具有名为高实用程序数据库投影和高实用程序交易合并(HTM)的高效数据库投影和事务合并技术,也在线性时间中执行。关于各种数据集的广泛实验研究表明,eFIM通常比最先进的算法,惠矿工,HUP-MILER,FHM和上升+在密集的数据集中快速增长,并且表现得非常好在稀疏数据集上。此外,EFIM的主要优点是其低存储器消耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号