...
首页> 外文期刊>Intelligent data analysis >Efficiently mining of skyline frequent-utility patterns
【24h】

Efficiently mining of skyline frequent-utility patterns

机译:高效挖掘天际线频繁使用模式

获取原文
获取原文并翻译 | 示例

摘要

Frequent itemset mining (FIM) is one of the most common data mining techniques, which is based on the analysis of the occurrence frequencies of items in transactions. However, it is inapplicable in real-life situations since customers may purchase several units of the same item and all items may not have the same unit profits. High-utility itemset mining (HUIM) was designed to consider both the quantities and unit profits of items in databases, and has become an emerging and critical research topic in recent decades. The SKYMINE approach was proposed to mine the skyline frequent-utility patterns (SFUPs), by considering both the utility and the occurrence frequencies of items. A SFUP is a non-dominated itemset, where the dominance relationship between itemsets is based on the utility and frequency measures. Mining SFUPs using the SKYMINE algorithm and its (UP)-tree structure requires, however, long execution times. In this paper, we propose a more efficient algorithm named skyline frequency-utility (SFU)-Miner to mine the SFUPs, utilizing the utility-list structure. This latter structure is used to efficiently calculate the actual utilities of itemsets without generating candidates, contrarily to the SKYMINE algorithm and its UP-tree structure. Besides, an array called utility-max (umax) is further developed to keep information about the maximal utility for each occurrence frequency, which can be used to greatly reduce the amount of itemsets considered for directly mining the SFUPs. This property can be used to efficiently find the non-dominated itemsets based on the utility and frequency measures. Substantial experiments have been carried out to evaluate the proposed algorithm's performance. Results have shown that SFU-Miner outperforms the state-of-the-art SKYMINE algorithm for SFUP mining in terms of runtime, memory consumption, number of candidates, and scalability.
机译:频繁项集挖掘(FIM)是最常见的数据挖掘技术之一,它基于对交易中项的出现频率的分析。但是,它不适用于现实情况,因为客户可能会购买同一商品的多个单位,而所有商品可能没有相同的单位利润。高功能项集挖掘(HUIM)旨在同时考虑数据库中项的数量和单位利润,并且已成为近几十年来一个新兴且至关重要的研究主题。通过考虑项目的效用和出现频率,提出了SKYMINE方法来挖掘天际线频繁使用模式(SFUP)。 SFUP是非主导项目集,其中项目集之间的优势关系基于效用和频率度量。但是,使用SKYMINE算法及其(UP)树结构挖掘SFUP要求较长的执行时间。在本文中,我们提出了一种更有效的算法,称为天际线频率实用程序(SFU)-Miner,以利用实用程序列表结构来挖掘SFUP。与SKYMINE算法及其UP树结构相反,后一种结构用于有效地计算项集的实际效用,而无需生成候选项。此外,进一步开发了一个称为Utility-max(umax)的数组,以保留有关每个出现频率的最大效用的信息,该信息可用于大大减少直接挖掘SFUP所考虑的项集的数量。此属性可用于根据效用和频率度量有效地找到非主导项集。已经进行了大量实验以评估所提出算法的性能。结果表明,在运行时间,内存消耗,候选数量和可伸缩性方面,SFU-Miner优于用于SFUP挖掘的最新SKYMINE算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号