Similarity search for numerous patterns over multiple time series streams under dynamic time warping which supports data normalization

Bui Cong Giao; Duong Tuan Anh

首页> 外文期刊>Vietnam Journal of Computer Science >Similarity search for numerous patterns over multiple time series streams under dynamic time warping which supports data normalization

【24h】

Similarity search for numerous patterns over multiple time series streams under dynamic time warping which supports data normalization

机译：在动态时间规整下对多个时间序列流上的众多模式进行相似性搜索，从而支持数据归一化

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract A huge challenge in nowadays’ data mining is similarity search in streaming time series under Dynamic Time Warping (DTW). In the similarity search, data normalization is a must to obtain accurate results. However, data normalization on the fly and the DTW calculation cost a great deal of computational time and memory space. In the paper, we present two methods, SUCR-DTW and ESUCR-DTW, which conduct similarity search for numerous prespecified patterns over multiple time-series streams under DTW supporting data normalization. These two methods utilize a combination of techniques to mitigate the aforementioned costs. The efficient methods inherit the cascading lower bounds introduced in UCR-DTW, a state-of-the-art method of similarity search in the static time series, to admissibly prune off unpromising subsequences. To be adaptive in the streaming setting, SUCR-DTW performs incremental updates on the envelopes of new-coming time-series subsequences and incremental data normalization on time-series data. However, like UCR-DTW, SUCR-DTW retrieves only similar subsequences that have the same length as the patterns. ESUCR-DTW, an extension of SUCR-DTW, can find similar subsequences whose lengths are different from those of the patterns. Furthermore, our proposed methods exploit multi-threading to have a fast response to high-speed time-series streams. The experimental results show that SUCR-DTW obtains the same precision as UCR-DTW and has lower wall clock time. Besides, the experimental results of SUCR-DTW and ESUCR-DTW reveal that the extended method has higher accuracy in spite of longer wall clock time. Also, the paper evaluates the influence of incremental z -score normalization and incremental min–max normalization on the obtained results.

机译：摘要当今数据挖掘中的一个巨大挑战是动态时间规整（DTW）下流时间序列中的相似性搜索。在相似性搜索中，数据规范化是获得准确结果的必要条件。但是，动态数据归一化和DTW计算会花费大量计算时间和存储空间。在本文中，我们介绍了两种方法，SUCR-DTW和ESUCR-DTW，它们在DTW支持数据归一化的情况下，在多个时间序列流上对许多预定模式进行相似性搜索。这两种方法利用技术的组合来减轻上述成本。有效的方法继承了UCR-DTW（在静态时间序列中进行相似性搜索的最新方法）中引入的级联下界，以允许删除不希望的子序列。为了适应流媒体设置，SUCR-DTW对新出现的时间序列子序列的包络执行增量更新，并对时间序列数据进行增量数据归一化。但是，与UCR-DTW一样，SUCR-DTW仅检索与模式长度相同的相似子序列。 ESUCR-DTW是SUCR-DTW的扩展，可以找到长度与模式长度不同的相似子序列。此外，我们提出的方法利用多线程对高速时间序列流具有快速响应。实验结果表明，SUCR-DTW具有与UCR-DTW相同的精度，并具有较低的挂钟时间。此外，SUCR-DTW和ESUCR-DTW的实验结果表明，尽管壁钟时间较长，但扩展方法仍具有较高的精度。此外，本文评估了增量z得分归一化和增量最小最大归一化对所获得结果的影响。

著录项

来源
《Vietnam Journal of Computer Science》 |2016年第3期|共16页
作者
Bui Cong Giao; Duong Tuan Anh;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Combining raw and normalized data in multivariate time series classification with dynamic time warping [J] . Luczak Maciej Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2018,第1期

机译：用动态时间翘曲结合生成归一系列数据中的多变量时间序列分类
2. Similarity search in streaming time series with the support of Skyline index [J] . Duong Tuan Anh, Tran Thi Thanh Nga International Journal of Business Intelligence and Data Mining . 2014,第1期

机译：借助Skyline索引在流式时间序列中进行相似度搜索
3. Similarity search and pattern discovery in hydrological time series data mining [J] . Rulin Ouyang, Liliang Ren, Weiming Cheng, Hydrological Processes . 2010,第9期

机译：水文时间序列数据挖掘中的相似度搜索和模式发现
4. Similarity Search for Numerous Patterns in Multiple High-Speed Time-Series Streams [C] . Bui Cong Giao, Duong Tuan Anh International Conference on Knowledge and Systems Engineering . 2015

机译：相似性搜索多个高速时间序列流中的许多模式
5. Improving efficiency and effectiveness of dynamic time warping in large time series databases. [D] . Ratanamahatana, Chotirat. 2005

机译：提高大型时间序列数据库中动态时间规整的效率和有效性。
6. Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping [O] . THANAWIN RAKTHANMANON, BILSON CAMPANA, ABDULLAH MUEEN, -1

机译：解决大数据时间序列：动态时间规整下挖掘数千个时间序列子序列
7. Similarity search for numerous patterns over multiple time series streams under dynamic time warping which supports data normalization [O] . Bui Cong Giao, Duong Tuan Anh 2016

机译：在动态时间规整下对多个时间序列流上的众多模式进行相似性搜索，从而支持数据归一化

Similarity search for numerous patterns over multiple time series streams under dynamic time warping which supports data normalization

摘要

著录项

相似文献

相关主题

期刊订阅