...
【24h】

TKEH: an efficient algorithm for mining top-k high utility itemsets

机译:TKEH:挖掘Top-K高实用项目集的高效算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

High utility itemsets mining is a subfield of data mining with wide applications. Although the existing high utility itemsets mining algorithms can discover all the itemsets satisfying a given minimum utility threshold, it is often difficult for users to set a proper minimum utility threshold. A smaller minimum utility threshold value may produce a huge number of itemsets, whereas a higher one may produce a few itemsets. Specification of minimum utility threshold is difficult and time-consuming. To address these issues, top-k high utility itemsets mining has been defined where k is the number of high utility itemsets to be found. In this paper, we present an efficient algorithm (named TKEH) for finding top-k high utility itemsets. TKEH utilizes transaction merging and dataset projection techniques to reduce the dataset scanning cost. These techniques reduce the dataset when larger items are explored. TKEH employs three minimum utility threshold raising strategies. We utilize two strategies to prune search space efficiently. To calculate the utility of items and upper-bounds in linear time, TKEH utilizes array-based utility technique. We carried out some extensive experiments on real datasets. The results show that TKEH outperforms the state-of-the-art algorithms. Moreover, TKEH always performs better for dense datasets.
机译:高实用程序项集采矿是具有广泛应用的数据挖掘子场。虽然现有的高实用程序集合挖掘算法可以发现满足给定的最低实用程序阈值的所有项目集,但用户通常很难设置正确的最小实用程序阈值。较小的最低实用程序阈值可能会产生大量的项目集,而较高的则可能产生一些项目集。最小公用事业阈值的规范是困难且耗时的。为了解决这些问题,已经定义了Top-K高实用程序项集挖掘,其中K是要找到的高实用程序项的数量。在本文中,我们介绍了一个有效的算法(命名为TKEH),用于查找顶级K高实用程序项集。 TKEH利用交易合并和数据集投影技术来降低数据集扫描成本。这些技术在探索较大的物品时减少数据集。 TKEH采用三个最低实用阈值培养策略。我们有效地利用了两种策略来修剪搜索空间。要计算项目的实用性和线性时间中的上限,TKEh利用基于阵列的实用技术。我们对真实数据集进行了一些广泛的实验。结果表明,TKEH优于最先进的算法。此外,TKEH始终对密集数据集进行更好的表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号