首页> 外文期刊>Engineering Applications of Artificial Intelligence >High average-utility itemset mining with multiple minimum utility threshold: A generalized approach
【24h】

High average-utility itemset mining with multiple minimum utility threshold: A generalized approach

机译:高平均实用程序项集采用多个最低实用程序阈值:广义方法

获取原文
获取原文并翻译 | 示例
           

摘要

High Average-Utility Itemset (HAUI) mining is an emerging pattern mining technique to extract meaningful patterns from a transaction dataset. In the past, several HAUI mining algorithms have been developed with efficient upper-bounds and pruning strategies. However, all these algorithms use a single value of the minimum average-utility threshold for all the itemsets, which limits their applicability to real-life datasets. In order to address this issue, several HAUI mining algorithms with multiple average-utility thresholds have been developed that process the items in ascending order of their minimum average-utility threshold. However, it makes them inapplicable on traditional HAUI mining algorithms. Moreover, the perturbation in preference of items may reduce the performance of the algorithms. This paper presents an HAUI mining algorithm named Generalized High Average-utility Itemset Miner (GHAIM) that processes the items in ascending order of their Average Utility Upper-Bound (AUUB) like the traditional HAUI mining algorithms. A new approach named suffix minimum average-utility is proposed to retain the downward closure property of AUUB and several pruning methods. Besides, a compact list structure is also proposed to mine the HAUIs in one phase. Several pruning methods have been introduced for reducing search space and improving efficiency. Extensive experiments were performed with different sparse and dense types of datasets to determine GHAIM efficiency compared to two existing algorithms. It was observed from the results that GHAIM outperforms both the current algorithms in run time, memory consumption, number of candidate itemsets, and scalability.
机译:高平均实用程序项目集(HAUI)挖掘是一种新兴模式挖掘技术,用于从事务数据集中提取有意义的模式。过去,已经使用有效的上限和修剪策略开发了几种Haui挖掘算法。然而,所有这些算法都使用所有项目集的最小平均实用程序阈值的单个值限制了它们对现实生活数据集的适用性。为了解决这个问题,已经开发了几种具有多个平均实用程序阈值的HAUI挖掘算法,该算法以最小平均实用程序阈值的升序处理项目。但是,它使它们无法在传统的Haui采矿算法上不适用。此外,在物品的偏好中的扰动可以降低算法的性能。本文介绍了一个名为Generalized高平均实用程序项集(GHAim)的Haui挖掘算法,该矿物(GHAim)按照传统的Haui挖掘算法等平均实用程序上限(auub)的升序处理项目。提出了一种名为后缀最小平均实用程序的新方法,以保留Auub的向下闭合性和多种修剪方法。此外,还提出了一个紧凑的列表结构来在一个阶段挖掘Hauis。已经引入了几种修剪方法来减少搜索空间并提高效率。通过不同的稀疏和密集类型的数据集进行了广泛的实验,以与两个现有算法相比确定Ghaim效率。从Ghaim优于运行时,内存消耗,候选项目数量和可扩展性的结果,Ghaim优于当前算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号