BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams

Marwan Hassani; Daniel Toews; Alfredo Cuzzocrea; Thomas Seidl

首页> 外文期刊>International Journal of Data Science and Analytics >BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams

【24h】

BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams

机译：BFSPMINER：用于在数据流上挖掘连续模式的有效和有效的无批算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Supporting sequential pattern mining from data streams is nowadays a relevant problem in the area of data stream mining research. Actual proposals available in the literature are based on the well-known PrefixSpan approach and are, indeed, able to effectively bound the error of discovered patterns. This approach foresees the idea of dividing the target stream in a collection of manageable chunks, i.e., pieces of stream, in order to gain into effectiveness and efficiency. Unfortunately, mining patterns from stream chunks indeed introduce additional errors with respect to the basic application scenario where the target stream is mined continuously, in a non-batch manner. This is due to several reasons. First, since batches are processed individually, patterns that contain items from two consecutive batches are lost. Secondly, in most batch-based approaches, the decision about the frequency of a pattern is done locally inside a single batch. Thus, if a pattern is frequent in the stream but its items are scattered over different batches, it will be continuously pruned out and will never become frequent due to the algorithm's lack of the "complete-picture" perspective. In order to address so-delineated pattern mining problems, this paper introduces and experimentally assesses BFSPMiner, a Batch-Free Sequential Pattern Miner algorithm for effectively and efficiently mining patterns in streams without being constrained to the traditional batch-based processing. This allows us, for instance, to discover frequent patterns that would be lost according to alternative batch-based stream mining processing models. We complement our analytical contributions by means of a comprehensive experimental campaign of BFSPMiner against real-world data stream sets and in comparison with current batch-based stream sequential pattern mining algorithms.

机译：支持从数据流的顺序模式挖掘现在是数据流挖掘研究领域的相关问题。文献中可用的实际提案基于众所周知的前缀方法，并且实际上是能够有效地绑定发现模式的错误。这种方法预测将目标流划分在可管理的块的集合中，即流，流动，以获得有效性和效率。遗憾的是，来自流块的挖掘模式确实引入了关于连续开采目标流的基本应用场景的额外误差，以非批量方式。这是由于几个原因。首先，由于单独处理批处理，因此包含来自两个连续批次的项目的模式丢失。其次，在基于批量的方法中，关于图案频率的决定在单个批次内本地完成。因此，如果在流中频繁频繁，但其项目散布在不同的批次上，则由于算法缺乏“完整图像”的角度，它将连续地分散出来并且永远不会频繁。为了解决如此划定的模式挖掘问题，本文介绍并通过实验评估BFSPMINER，用于有效且有效地和有效地挖掘流中的模式，而不会被限制为传统的基于批处理的处理。例如，这允许我们发现根据基于批次的流挖掘处理模型将丢失的频繁模式。我们通过针对现实世界数据流集合的全面实验活动补充我们的分析贡献，并与当前基于批量流顺序模式挖掘算法相比。

著录项

来源
《International Journal of Data Science and Analytics》 |2019年第3期|223-239|共17页
作者
Marwan Hassani; Daniel Toews; Alfredo Cuzzocrea; Thomas Seidl;
展开▼
作者单位

Architecture of Information Systems Group Eindhoven University of Technology Eindhoven The Netherlands;

Fraunhofer Institute for Communication Information Processing and Ergonomics FKIE Wachtberg Germany;

DIA Department University of Trieste and ICAR-CNR Trieste Italy;

Database Systems Group LMU Munich Munich Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sequential pattern mining; Data streams; Batch-free;

机译：顺序模式挖掘;数据流;无批处理;

相似文献

外文文献
中文文献
专利

1. Batch-Free Event Sequence Pattern Mining for Communication Stream Data with Instant and Persistent Events [J] . Lee Keon Myung, Han Chan Sik, Jun Joong Nam, Wireless personal communications: An Internaional Journal . 2019,第2期

机译：无批次的事件序列模式挖掘用于即时和持久事件的通信流数据
2. Efficiently mining high utility sequential patterns in static and streaming data [J] . Zihayat Morteza, Wu Cheng-Wei, An Aijun, Intelligent data analysis . 2017,第SUPPLa期

机译：在静态和流数据中有效挖掘高效的顺序模式
3. PTree: Mining Sequential Patterns Efficiently in Multiple Data Streams Environment [J] . Guanling Lee, Yi-Chun Chen, Kuo-Che Hung Journal of information science and engineering . 2013,第6期

机译：PTree：在多个数据流环境中有效地挖掘顺序模式
4. Effective Database Transformation and Efficient Support Computation for Mining Sequential Patterns [C] . Chung-Wen Cho, Yi-Hung Wu, Arbee L.P. Chen International Conference on Database Systems for Advanced Applications . 2005

机译：用于采矿顺序模式的有效数据库转换和高效支持计算
5. Mining frequent sequential patterns in data streams using SSM-algorithm. [D] . Monwar, Mostafa. 2005

机译：使用SSM算法在数据流中挖掘频繁的顺序模式。
6. An Efficient Incremental Mining Algorithm for Discovering Sequential Pattern in Wireless Sensor Network Environments [O] . Xin Lyu, Hongxu Ma 2019

机译：在无线传感器网络环境中发现顺序模式的高效增量挖掘算法
7. BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams [O] . Marwan Hassani, Daniel Töws, Alfredo Cuzzocrea, 2017

机译：BFSPMINER：用于在数据流上挖掘连续模式的有效和有效的无批算法

BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams

摘要

著录项

相似文献

相关主题

期刊订阅