Efficiently mining of skyline frequent-utility patterns

Pan Jeng-Shyang; Lin Jerry Chun-Wei; Yang Lu; Fournier-Viger Philippe; Hong Tzung-Pei

首页> 外文期刊>Intelligent data analysis >Efficiently mining of skyline frequent-utility patterns

【24h】

Efficiently mining of skyline frequent-utility patterns

机译：高效挖掘天际线频繁使用模式

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Frequent itemset mining (FIM) is one of the most common data mining techniques, which is based on the analysis of the occurrence frequencies of items in transactions. However, it is inapplicable in real-life situations since customers may purchase several units of the same item and all items may not have the same unit profits. High-utility itemset mining (HUIM) was designed to consider both the quantities and unit profits of items in databases, and has become an emerging and critical research topic in recent decades. The SKYMINE approach was proposed to mine the skyline frequent-utility patterns (SFUPs), by considering both the utility and the occurrence frequencies of items. A SFUP is a non-dominated itemset, where the dominance relationship between itemsets is based on the utility and frequency measures. Mining SFUPs using the SKYMINE algorithm and its (UP)-tree structure requires, however, long execution times. In this paper, we propose a more efficient algorithm named skyline frequency-utility (SFU)-Miner to mine the SFUPs, utilizing the utility-list structure. This latter structure is used to efficiently calculate the actual utilities of itemsets without generating candidates, contrarily to the SKYMINE algorithm and its UP-tree structure. Besides, an array called utility-max (umax) is further developed to keep information about the maximal utility for each occurrence frequency, which can be used to greatly reduce the amount of itemsets considered for directly mining the SFUPs. This property can be used to efficiently find the non-dominated itemsets based on the utility and frequency measures. Substantial experiments have been carried out to evaluate the proposed algorithm's performance. Results have shown that SFU-Miner outperforms the state-of-the-art SKYMINE algorithm for SFUP mining in terms of runtime, memory consumption, number of candidates, and scalability.

机译：频繁项集挖掘（FIM）是最常见的数据挖掘技术之一，它基于对交易中项的出现频率的分析。但是，它不适用于现实情况，因为客户可能会购买同一商品的多个单位，而所有商品可能没有相同的单位利润。高功能项集挖掘（HUIM）旨在同时考虑数据库中项的数量和单位利润，并且已成为近几十年来一个新兴且至关重要的研究主题。通过考虑项目的效用和出现频率，提出了SKYMINE方法来挖掘天际线频繁使用模式（SFUP）。 SFUP是非主导项目集，其中项目集之间的优势关系基于效用和频率度量。但是，使用SKYMINE算法及其（UP）树结构挖掘SFUP要求较长的执行时间。在本文中，我们提出了一种更有效的算法，称为天际线频率实用程序（SFU）-Miner，以利用实用程序列表结构来挖掘SFUP。与SKYMINE算法及其UP树结构相反，后一种结构用于有效地计算项集的实际效用，而无需生成候选项。此外，进一步开发了一个称为Utility-max（umax）的数组，以保留有关每个出现频率的最大效用的信息，该信息可用于大大减少直接挖掘SFUP所考虑的项集的数量。此属性可用于根据效用和频率度量有效地找到非主导项集。已经进行了大量实验以评估所提出算法的性能。结果表明，在运行时间，内存消耗，候选数量和可伸缩性方面，SFU-Miner优于用于SFUP挖掘的最新SKYMINE算法。

著录项

来源
《Intelligent data analysis 》 |2017年第6期| 1407-1423| 共17页
作者
Pan Jeng-Shyang; Lin Jerry Chun-Wei; Yang Lu; Fournier-Viger Philippe; Hong Tzung-Pei;
展开▼
作者单位

Fuzhou Univ Int Studies & Trade, Fuzhou, Fujian, Peoples R China;

Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen Grad Sch, Shenzhen, Guangdong, Peoples R China;

Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen Grad Sch, Shenzhen, Guangdong, Peoples R China;

Harbin Inst Technol, Shenzhen Grad Sch, Sch Nat Sci & Humanities, Shenzhen, Guangdong, Peoples R China;

Natl Univ Kaohsiung, Dept Comp Sci & Informat Engn, Kaohsiung, Taiwan;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Data mining; umax array; skyline frequent-utility patterns; frequency; utility;

机译：数据挖掘;umax数组;天际线频繁使用模式;频率;实用性;

相似文献

外文文献
中文文献
专利

1. Mining of skyline patterns by considering both frequent and utility constraints [J] . Lin Jerry Chun-Wei, Yang Lu, Fournier-Viger Philippe, Engineering Applications of Artificial Intelligence . 2019 ,第JANa期

机译：通过考虑频繁和实用约束来挖掘天际线模式
2. Efficient mining of skyline objects in subspaces over data streams [J] . Zhenhua Huang, Shengli Sun, Wei Wang Knowledge and information systems . 2010 ,第2期

机译：通过数据流有效挖掘子空间中的天际线对象
3. Efficient mining of skyline objects in subspaces over data streams [J] . Zhenhua Huang, Shengli Sun, Wei Wang Knowledge and Information Systems . 2010 ,第2期

机译：通过数据流有效挖掘子空间中的天际线对象
4. A More Efficient Algorithm to Mine Skyline Frequent-Utility Patterns [C] . Jerry Chun-Wei Lin, Lu Yang, Philippe Fournier-Viger, International Conference on Genetic and Evolutionary Computing . 2017

机译：一种更有效的矿山地平线频繁实用模式的算法
5. Data mining analysis of digital library database usage patterns as a tool facilitating efficient user navigation. [D] . Gibson, Ian Eric. 2001

机译：数字图书馆数据库使用模式的数据挖掘分析是一种有助于高效用户导航的工具。
6. TSARM-UDP: An Efficient Time Series Association Rules Mining Algorithm Based on Up-to-Date Patterns [O] . Qiang Zhao, Qing Li, Deshui Yu, 2021

机译：TSARM-UDP：基于最新模式的有效时间序列关联规则挖掘算法
7. Skyline Diagram: Efficient Space Partitioning for Skyline Queries [O] . Jinfei Liu, Juncheng Yang, Li Xiong, 2021

机译：地平线图：用于天际线查询的高效空间分区
8. Efficient Closed Pattern Mining in the Presence of Tough Block Constraints [R] . Gade, K. , Wang, J. , Karypis, G. 2003

机译：在严格块约束条件下的高效闭模式挖掘

Efficiently mining of skyline frequent-utility patterns

摘要

著录项

相似文献

相关主题

期刊订阅