Prefix-Querying: An Approach for Effective Subsequence Matching Under Time Warping in Sequence Databases

机译：前缀查询：序列数据库中扭曲下的有效后续匹配的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper discusses an index-based subsequence matching that supports time warping in large sequence databases. Time warping enables finding sequences with similar patterns even when they are of different lengths. In our earlier work, we suggested an efficient method for whole matching under time warping. This method constructs a multidimensional index on a set of feature vectors, which are invariant to time warping, from data sequences. For filtering at feature space, it also applies a lower-bound function, which consistently underestimates the time warping distance as well as satisfies the triangular inequality. In this paper, we incorporate the prefix-querying approach based on sliding windows into the earlier approach. For indexing, we extract a feature vector from every subsequence inside a sliding window and construct a multi-dimensional index using a feature vector as indexing attributes. For query processing, we perform a series of index searches using the feature vectors of qualifying query prefixes. Our approach provides effective and scalable subsequence matching even with a large volume of a database. We also prove that our approach does not incur false dismissal. To verify the superiority of our method, we perform extensive experiments. The results reveal that our method achieves significant speedup with real-world S&P 500 stock data and with very large synthetic data.

机译：本文讨论了基于索引的子序列匹配，支持在大序列数据库中的时间翘曲。即使当它们具有不同的长度时，时间扭曲使得能够找到具有类似模式的序列。在我们之前的工作中，我们建议在扭曲的情况下为整个匹配的有效方法。该方法在一组特征向量上构建多维索引，其从数据序列中不变时不变。为了在特征空间进行过滤，它还应用一个较低的函数，这始终低估了时间翘曲距离以及满足三角不平等。在本文中，我们将基于滑动窗口的前缀查询方法纳入前面的方法。对于索引，我们从滑动窗口内的每个子序列提取一个特征向量，并使用要素向量构造多维索引作为索引属性。对于查询处理，我们使用限定查询前缀的特征向量执行一系列索引搜索。我们的方法也提供了甚至具有大量数据库的有效和可扩展的子序列匹配。我们还证明了我们的方法不会遭受虚假解雇。为了验证我们方法的优越性，我们执行广泛的实验。结果表明，我们的方法通过现实世界的标准普尔500辆股票数据和具有非常大的合成数据来实现显着加速。

著录项

来源
《International conference on information and knowledge management》|2001年||共8页
会议地点
作者
Sriram Padmanabhan; June-Suh Cho; Sang-Wook Kim; Sanghyun Park;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息管理系统;
关键词
similarity search; prefix querying; indexing technique; time warping; subsequence matching; sequence database;

机译：相似性搜索;前缀查询;索引技术;时间翘曲;后续匹配;序列数据库;

相似文献

外文文献
中文文献
专利

1. Prefix-querying with an L_1 distance metric for time-series subsequence matching under time warping [J] . Sanghyun Park, Sang-Wook Kim Journal of Information Science . 2006,第5期

机译：在时间扭曲下使用L_1距离度量进行前缀查询以进行时间序列子序列匹配
2. Subsequence matching under time warping in time-series databases: observation, optimization, and performance results [J] . Sang-Wook Kim, Miyoung Shin International Journal of Computer Systems Science & Engineering . 2008,第1期

机译：时间序列数据库中时间扭曲下的子序列匹配：观察，优化和性能结果
3. Linear Detrending Subsequence Matching in Time-Series Databases [J] . Myeong-Seon GIL, Yang-Sae MOON, Bum-Soo KIM IEICE transactions on information and systems . 2011,第4期

机译：时间序列数据库中的线性去趋势子序列匹配
4. Prefix-Querying: An Approach for Effective Subsequence Matching Under Time Warping in Sequence Databases [C] . Sriram Padmanabhan, June-Suh Cho, Sang-Wook Kim, International conference on information and knowledge management . 2001

机译：前缀查询：序列数据库中扭曲下的有效后续匹配的方法
5. Embedding-based subsequence matching in large sequence databases. [D] . Papapetrou, Panagiotis. 2010

机译：大序列数据库中基于嵌入的子序列匹配。
6. Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping [O] . THANAWIN RAKTHANMANON, BILSON CAMPANA, ABDULLAH MUEEN, -1

机译：解决大数据时间序列：动态时间规整下挖掘数千个时间序列子序列
7. L-Match: A Lightweight and Effective Subsequence Matching Approach [O] . Kefeng Feng, Peng Wang, Jiaye Wu, 2020

机译：L-LIVE：轻量级和有效的后续匹配方法

Prefix-Querying: An Approach for Effective Subsequence Matching Under Time Warping in Sequence Databases

摘要

著录项

相似文献

相关主题

期刊订阅