Mining High Utility Patterns in One Phase without Generating Candidates

J. Liu; K. Wang; B. C. M. Fung

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Mining High Utility Patterns in One Phase without Generating Candidates

【24h】

Mining High Utility Patterns in One Phase without Generating Candidates

机译：在不生成候选对象的情况下，在一个阶段中挖掘高实用性模式

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Utility mining is a new development of data mining technology. Among utility mining problems, utility mining with the itemset share framework is a hard one as no anti-monotonicity property holds with the interestingness measure. Prior works on this problem all employ a two-phase, candidate generation approach with one exception that is however inefficient and not scalable with large databases. The two-phase approach suffers from scalability issue due to the huge number of candidates. This paper proposes a novel algorithm that finds high utility patterns in a single phase without generating candidates. The novelties lie in a high utility pattern growth approach, a lookahead strategy, and a linear data structure. Concretely, our pattern growth approach is to search a reverse set enumeration tree and to prune search space by utility upper bounding. We also look ahead to identify high utility patterns without enumeration by a closure property and a singleton property. Our linear data structure enables us to compute a tight bound for powerful pruning and to directly identify high utility patterns in an efficient and scalable way, which targets the root cause with prior algorithms. Extensive experiments on sparse and dense, synthetic and real world data suggest that our algorithm is up to 1 to 3 orders of magnitude more efficient and is more scalable than the state-of-the-art algorithms.

机译：实用程序挖掘是数据挖掘技术的新发展。在实用程序挖掘问题中，具有项集共享框架的实用程序挖掘是一项艰巨的任务，因为没有反单调性与趣味性测度一致。关于此问题的现有技术都采用了两阶段的候选生成方法，但有一个例外，该方法效率低下并且无法在大型数据库中扩展。由于候选人数量众多，两阶段方法存在可伸缩性问题。本文提出了一种新颖的算法，该算法可在单相中查找高效模式，而无需生成候选函数。新奇之处在于高实用性模式增长方法，超前策略和线性数据结构。具体而言，我们的模式增长方法是搜索反向集枚举树，并通过效用上限限制修剪搜索空间。我们还期待在不通过闭包属性和单例属性进行枚举的情况下，确定高实用性模式。我们的线性数据结构使我们能够计算出紧密的界限，以进行强大的修剪，并以有效且可扩展的方式直接识别高实用性模式，这是针对现有算法的根本原因。在稀疏，密集，合成和真实世界的数据上进行的大量实验表明，与最新算法相比，我们的算法效率高1到3个数量级，并且可扩展性更高。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2016年第5期|1245-1257|共13页
作者
J. Liu; K. Wang; B. C. M. Fung;
展开▼
作者单位

School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Data mining; data mining; frequent patterns; high utility patterns; pattern mining; utility mining;

机译：数据挖掘;数据挖掘;频繁模式;高效用模式;模式挖掘;效用挖掘;

相似文献

外文文献
中文文献
专利

1. Survey Mining High Utility Patterns In One Phase Without Generating Candidates [J] . Dr. P.Sengottuvelan, Prof. S. Joseph Gabriel International Journal of Computer Trends and Technology . 2016,第2期

机译：在不生成候选对象的情况下，在一个阶段中调查挖掘高实用性模式
2. Mining High Utility Pattern in One Phase without Candidate Generation using up Growth+ Algorithm [J] . P.Sri Varshini, N.Saranya.N, Uma Maheswari, International Journal of Engineering Trends and Technology . 2017,第4期

机译：使用Up Growth算法在没有候选生成的一期中挖掘高实用图案
3. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach [J] . Jiawei Han, Jian Pei, Yiwen Yin, Data mining and knowledge discovery . 2004,第1期

机译：没有候选生成的频繁模式：频繁模式树方法
4. Mining Maximal Sequential Patterns without Candidate Maintenance [C] . Philippe Fournier-Viger, Cheng-Wei Wu, Vincent S. Tseng International conference on advanced data mining and applications . 2013

机译：在没有候选维护的情况下挖掘最大顺序模式
5. Frequent pattern mining without candidate generation or support constraint. [D] . Cheung, William. 2003

机译：没有候选者生成或支持约束的频繁模式挖掘。
6. Mining significant high utility gene regulation sequential patterns [O] . Morteza Zihayat, Heidar Davoudi, Aijun An 2017

机译：挖掘重要的高效基因调控顺序模式
7. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach [O] . Jiawei Han, Jian Pei, Yiwen Yin, 2004

机译：挖掘没有候选者生成的频繁模式：频繁模式树方法

Mining High Utility Patterns in One Phase without Generating Candidates

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅