Duality-based subsequence matching in time-series databases

机译：时间序列数据库中基于对偶的子序列匹配

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The authors propose a subsequence matching method, Dual Match, which exploits duality in constructing windows and significantly improves performance. Dual Match divides data sequences into disjoint windows and the query sequence into sliding windows, and thus, is a dual approach of the one by C. Faloutsos et al. (1994), which divides data sequences into sliding windows and the query sequence into disjoint windows. We formally prove that our dual approach is correct, i.e., it incurs no false dismissal. We also prove that, given the minimum query length, there is a maximum bound of the window size to guarantee correctness of Dual Match and discuss the effect of the window size on performance. FRM causes a lot of false alarms by storing minimum bounding rectangles rather than individual points representing windows to avoid excessive storage space required for the index. Dual Match solves this problem by directly storing points, but without incurring excessive storage overhead. Experimental results show that, in most cases, Dual Match provides large improvement in both false alarms and performance over FRM, given the same amount of storage space. In particular, for low selectivities (less than 10/sup -4/), Dual Match significantly improves performance up to 430-fold. On the other hand, for high selectivities(more than 10/sup -2/), it shows a very minor degradation (less than 29%). For selectivities in between (10/sup -4//spl sim/10/sup -2/), Dual Match shows performance slightly better than that of FRM. Dual Match is also 4.10/spl sim/25.6 times faster than FRM in building indexes of approximately the same size. Overall, these results indicate that our approach provides a new paradigm in subsequence matching that improves performance significantly in large database applications.

机译：作者提出了一种后续匹配方法，双重匹配，它在构建Windows中利用二元性并显着提高性能。双重匹配将数据序列划分为不相交的窗口和查询序列转换为滑动窗口，因此是C. faloutsos等人的双方法。（1994），将数据序列划分为滑动窗口和查询序列分成不相交的窗口。我们正式证明我们的双重方法是正确的，即，它不会被错误解雇。我们还证明，鉴于最小查询长度，窗口大小有最大限度的界限，以保证双匹配的正确性，并讨论窗口大小对性能的影响。 FRM通过存储最小边界矩形而不是表示窗口的单个点来引起大量误报，以避免索引所需的过度存储空间。双重匹配通过直接存储点来解决此问题，但不会产生过多的存储开销。实验结果表明，在大多数情况下，双匹配在鉴于相同数量的存储空间，双重匹配在FRM上的误报和性能方面提供了大量的改进。特别是，对于低选择性（小于10 / sup -4 /），双匹配显着提高了430倍的性能。另一方面，对于高选择性（超过10 / sup -2 /），它显示出非常轻微的降解（小于29％）。对于在（10 / SUP-4 // SPL SIM / 10 / SUP-2 /）之间的选择性，双匹配显示性能略高于FRM。双匹配也比FRM在大致相同的尺寸大致相同的索引中的4.10 / SPL SIM / 25.6倍。总的来说，这些结果表明，我们的方法在随后匹配中提供了一种新的范例，这在大型数据库应用程序中提高了性能。

著录项

来源
《》|2001年|P.263-272|共10页
会议地点
作者
Yang-Sae Moon; Kyu-Young Whang; Woong-Kee Loh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Linear Detrending Subsequence Matching in Time-Series Databases [J] . Myeong-Seon GIL, Yang-Sae MOON, Bum-Soo KIM IEICE transactions on information and systems . 2011,第4期

机译：时间序列数据库中的线性去趋势子序列匹配
2. Linear Detrending Subsequence Matching in Time-Series Databases [J] . Myeong-Seon GIL, Yang-Sae MOON, Bum-Soo KIM IEICE Transactions on Information and Systems . 2011,第4期

机译：时间序列数据库中的线性去趋势子序列匹配
3. Embedding-Based Subsequence Matching in Time-Series Databases [J] . PANAGIOTIS PAPAPETROU, MICHALIS POTAMIAS, GEORGE KOLLIOS, ACM transactions on database systems . 2011,第3期

机译：时间序列数据库中基于嵌入的子序列匹配
4. Duality-based subsequence matching in time-series databases [C] . Yang-Sae Moon, Kyu-Young Whang, Woong-Kee Loh, International Conference on Data Engineering . 2001

机译：基于二元的子序列匹配时间序列数据库
5. Embedding-based subsequence matching in large sequence databases. [D] . Papapetrou, Panagiotis. 2010

机译：大序列数据库中基于嵌入的子序列匹配。
6. Mining Interesting Patterns in Time-series Medical Databases: A Hybrid Approach of Multiscale Matching and Rough Clustering [O] . Shoji Hirano, Shusaku Tsumoto 2002

机译：在时序医学数据库中挖掘有趣的模式：多尺度匹配和粗糙聚类的混合方法
7. Linear Detrending Subsequence Matching in Time-Series Databases [O] . Myeong-seon Gil, Yang-sae Moon, Bum-soo Kim 2016

机译：时间序列数据库中的线性去趋势子序列匹配

Duality-based subsequence matching in time-series databases

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅