Incremental Algorithm for Discovering Frequent Subsequences in Multiple Data Streams

Reem Al-Mulla; Zaher Al Aghbari

首页> 外文期刊>International Journal of Data Warehousing and Mining >Incremental Algorithm for Discovering Frequent Subsequences in Multiple Data Streams

【24h】

Incremental Algorithm for Discovering Frequent Subsequences in Multiple Data Streams

机译：用于发现多个数据流中频繁子序列的增量算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, new applications emerged that produce data streams, such as stock data and sensor networks. Therefore, finding frequent subsequences, or clusters of subsequences, in data streams is an essential task in data mining. Data streams are continuous in nature, unbounded in size and have a high arrival rate. Due to these characteristics, traditional clustering algorithms fail to effectively find clusters in data streams. Thus, an efficient incremental algorithm is proposed to find frequent subsequences in multiple data streams. The described approach for finding frequent subsequences is by clustering subsequences of a data stream. The proposed algorithm uses a window model to buffer the continuous data streams. Further, it does not recompute the clustering results for the whole data stream at every window, but rather it builds on clustering results of previous windows. The proposed approach also employs a decay value for each discovered cluster to determine when to remove old clusters and retain recent ones. In addition, the proposed algorithm is efficient as it scans the data streams once and it is considered an Any-time algorithm since the frequent subsequences are ready at the end of every window.

机译：近年来，出现了产生数据流的新应用程序，例如库存数据和传感器网络。因此，在数据流中查找频繁的子序列或子序列簇是数据挖掘中的基本任务。数据流本质上是连续的，大小不受限制，并且到达率很高。由于这些特性，传统的聚类算法无法有效地找到数据流中的聚类。因此，提出了一种有效的增量算法来查找多个数据流中的频繁子序列。用于发现频繁子序列的所述方法是通过对数据流的子序列进行聚类。所提出的算法使用窗口模型来缓冲连续数据流。此外，它不会在每个窗口重新计算整个数据流的聚类结果，而是在先前窗口的聚类结果基础上构建。所提出的方法还对每个发现的簇使用一个衰减值，以确定何时删除旧簇并保留最近的簇。另外，该算法是高效的，因为它一次扫描数据流，并且由于频繁的子序列已在每个窗口的末尾准备好，因此被认为是随时算法。

著录项

来源
《International Journal of Data Warehousing and Mining》 |2011年第4期|共20页
作者
Reem Al-Mulla; Zaher Al Aghbari;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类矿业工程;
关键词
Any-Time Algorithm; Clustering Subsequences; Data Streams; Frequent Subsequences; Incremental Algorithm;

机译：任意时间算法;聚类子序列;数据流;频繁子序列;增量算法;
入库时间 2022-08-18 10:40:40

相似文献

外文文献
中文文献
专利

1. Incremental Algorithm for Discovering Frequent Subsequences in Multiple Data Streams [J] . Reem Al-Mulla, Zaher Al Aghbari International Journal of Data Warehousing and Mining . 2011,第4期

机译：用于发现多个数据流中频繁子序列的增量算法
2. Efficient stream subsequence matching algorithms for handheld devices on streaming time-series data [J] . Yang-Sae Moon, Woong-Kee Loh International Journal of Computer Systems Science & Engineering . 2009,第4期

机译：手持设备在流时间序列数据上的高效流子序列匹配算法
3. A Study on Distributed Frequent Co-occurrence Patterns Algorithms across Multiple Data Streams [J] . Jing Guo Journal of software . 2016,第12期

机译：跨多个数据流的分布式频繁共现模式算法研究
4. A New Algorithm for Maintaining Closed Frequent Itemsets in Data Streams by Incremental Updates [C] . Hua-Fu Li, Chin-Chuan Ho, Fang-Fei Kuo, IEEE International Conference on Data Mining . 2006

机译：一种新的算法，通过增量更新维护数据流中的闭合频繁项集
5. Mining frequent sequential patterns in data streams using SSM-algorithm. [D] . Monwar, Mostafa. 2005

机译：使用SSM算法在数据流中挖掘频繁的顺序模式。
6. Designing a Streaming Algorithm for Outlier Detection in Data Mining—An Incremental Approach [O] . Kangqing Yu, Wei Shi, Nicola Santoro 2020

机译：设计用于数据挖掘中异常值检测的流算法—一种增量方法
7. A Study on Distributed Frequent Co-occurrence Patterns Algorithms across Multiple Data Streams [O] . Jing Guo 2016

机译：多数据流分布式频繁共生成模式算法的研究

Incremental Algorithm for Discovering Frequent Subsequences in Multiple Data Streams

摘要

著录项

相似文献

相关主题

期刊订阅