An Algorithm of Top-k High Utility Itemsets Mining over Data Stream

Tianjun Lu; Yang Liu; Le Wang

首页> 外文期刊>Journal of software >An Algorithm of Top-k High Utility Itemsets Mining over Data Stream

【24h】

An Algorithm of Top-k High Utility Itemsets Mining over Data Stream

机译：数据流中Top-i> k 高效项集挖掘算法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Existing top-k high utility itemset (HUI) mining algorithms generate candidate itemsets in the mining process; their timeandspace performance might be severely affected when the dataset is large or contains many long transactions; and when applied to data streams, the performance of corresponding mining algorithm is especially crucial. To address this issue, propose a sliding window based top-k HUIs mining algorithm TOPK-SW; it first stores each batch data of current window as well as the items’ utility information to a tree called HUI-Tree, which ensures effective retrieval of utility values without re-scan the dataset, so as to efficiently improve the mining performance. TOPK-SW was tested on 4 classical datasets; results show that TOPK-SW outperforms existing algorithms significantly in both time and space efficiency, especially the time performance improves over 1 order of magnitude.

机译：现有的top-k高效项目集（HUI）挖掘算法会在挖掘过程中生成候选项目集;当数据集很大或包含许多长事务时，它们的时间和空间性能可能会受到严重影响;当应用于数据流时，相应挖掘算法的性能尤为关键。为了解决这个问题，提出了一种基于滑动窗口的前k个HUI挖掘算法TOPK-SW。它首先将当前窗口的每个批次数据以及项目的实用程序信息存储到名为HUI-Tree的树中，该树确保有效地检索实用程序值而无需重新扫描数据集，从而有效地提高了挖掘性能。 TOPK-SW在4个经典数据集上进行了测试;结果表明，TOPK-SW在时间和空间效率上均明显优于现有算法，尤其是时间性能提高了1个数量级。

著录项

来源
《Journal of software》 |2014年第9期|共6页
作者
Tianjun Lu; Yang Liu; Le Wang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
data streamhigh utility itemsetfrequent itemsetdata miningtop-k;

机译：数据流高实用项集频繁项集数据挖掘top-k;

相似文献

外文文献
中文文献
专利

1. An Algorithm of Top-k High Utility Itemsets Mining over Data Stream [J] . Tianjun Lu, Yang Liu, Le Wang Journal of software . 2014,第9期

机译：基于数据流的Top-k高效项目集挖掘算法
2. An Algorithm of Top-k High Utility Itemsets Mining over Data Stream [J] . Tianjun Lu, Yang Liu, Le Wang Journal of Computers . 2014,第9期

机译：数据流中Top-i> k 高效项集挖掘算法
3. An Algorithm of Top-k High Utility Itemsets Mining over Data Stream [J] . Tianjun Lu, Yang Liu, Le Wang Journal of Computers . 2014,第9期

机译：数据流中Top-i> k 高效项集挖掘算法
4. Implementing a Hybrid of Efficient Algorithms For Mining Top-K High Utility Itemsets [C] . Ingle Mayur Rajendra, Sanika Sameer Moghe, Sachin Sakhare, International Conference on Computing Communication Control and Automation . 2018

机译：实施高效算法的混合，以挖掘Top-K高实用程序集
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. A Fast and Efficient Algorithm for Mining Top-k Nodes in Complex Networks [O] . Dong Liu, Yun Jing, Jing Zhao, -1

机译：复杂网络中Top-k节点的快速高效挖掘算法
7. High Utility Itemset Mining with Top-k CHUD (TCHUD) Algorithm [O] . Anu Augustin, Vince Paul, Vishnu G. 2017

机译：高实用程序项目集与Top-K Chud（TCHUD）算法挖掘

An Algorithm of Top-k High Utility Itemsets Mining over Data Stream

摘要

著录项

相似文献

相关主题

期刊订阅