...
首页> 外文期刊>Knowledge and information systems >An efficient projection-based method for high utility itemset mining using a novel pruning approach on the utility matrix
【24h】

An efficient projection-based method for high utility itemset mining using a novel pruning approach on the utility matrix

机译:使用新型修剪方法在公用事业矩阵上的高效投影方法

获取原文
获取原文并翻译 | 示例

摘要

High utility itemset mining is an important extension of frequent itemset mining which considers unit profits and quantities of items as external and internal utilities, respectively. Since the utility function has not downward closure property, an overestimated value of utility is obtained using an anti-monotonic upper bound of utility function to prune the search space and improve the efficiency of high utility itemset mining methods. Transaction-weighted utilization (TWU) of itemset was the first and one of the most important functions which has been used as the anti-monotonic upper bound of utility by various algorithms. A variety of high utility itemset mining methods have attempted to tighten the utility upper bound and have exploited appropriate pruning strategies to improve mining efficiency. Although TWU and its improved alternatives have attempted to increase the efficiency of high utility itemset mining methods by pruning their search spaces, they suffer from a significant number of generated candidates which are high-TWU but are not high utility itemsets. Calculating the actual utilities of low utility candidates needs to multiple scanning of the dataset and thus imposes a huge overhead to the mining methods, which can cause to lose the pruning benefits of the upper bounds. Proposing appropriate pruning strategies, exploiting efficient data structures, and using tight anti-monotonic upper bounds can overcome this problem and lead to significant performance improvement in high utility itemset mining methods. In this paper, a new projection-based method, called MAHI (matrix-aided high utility itemset mining), is introduced which uses a novel utility matrix-based pruning strategy, called MA-prune to improve the high utility itemset mining performance in terms of execution time. The experimental results show that MAHI is faster than former algorithms.
机译:高实用程序项目集挖掘是频繁的项目集挖掘的重要扩展,其分别将单位利润和数量的项目视为外部和内部公用事业。由于实用程序函数没有向下闭合属性,因此使用实用程序功能的反单调上限获得高度归因的实用程序值,以修剪搜索空间并提高高实用程序项集挖掘方法的效率。项目集的交易加权利用(TWU)是最重要的功能和最重要的功能之一,该功能被各种算法用作效用的反单调上限。各种高效项目集采矿方法试图收紧实用性上限,并利用适当的修剪策略来提高采矿效率。虽然TWU及其改进的替代方案已经尝试通过修剪搜索空间来提高高效项目集合方法的效率,但它们遭受大量产生的候选者,这些候选者是高度TWU但不高的公用事业项目。计算低实用程序候选者的实际实用程序需要多次扫描数据集,因此对采矿方法施加了巨大的开销,这可能导致失去上限的修剪益处。提出适当的修剪策略,利用高效的数据结构,并使用紧密的反单调上限可以克服这个问题,并导致高实用程序项目集挖掘方法的显着性能改进。本文介绍了一种新的基于投影的方法,称为MAHI(矩阵辅助高实用程序挖掘),它使用了一种名为MA-FRUNE的新型实用矩阵的修剪策略,以提高高效项目集挖掘性能执行时间。实验结果表明,Mahi比以前的算法快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号