首页> 外文期刊>International Journal of Computer Systems Science & Engineering >Subsequence matching under time warping in time-series databases: observation, optimization, and performance results
【24h】

Subsequence matching under time warping in time-series databases: observation, optimization, and performance results

机译:时间序列数据库中时间扭曲下的子序列匹配:观察,优化和性能结果

获取原文
获取原文并翻译 | 示例
       

摘要

This paper discusses effective processing of subsequence matching under time warping in time-series databases. Time warping is a transformation that enables the finding of sequences with similar patterns even when they are of different lengths. Through a preliminary experiment, we first point out that the performance bottleneck of Naive-Scan, a basic method for processing of subsequence matching under time warping, occurs in the CPU processing step. Then, we propose a novel method that optimizes such CPU processing step of Naive-Scan. The proposed method maximizes the CPU performance by eliminating all the redundant calculations required in computing the time warping distance of the query sequence against data subsequences. We formally prove that the proposed method is not only an optimal one for processing Naive-Scan but also does not incur false dismissals. Also, we discuss how to apply the proposed method to the post-processing step of LB-Scan and ST-Filter, the two well-known methods that employ the filtering step. Then, we quantitatively verify the performance improvement effects obtained by the proposed method via extensive experiments. The results show that the performance of all the three previous methods improves by employing the proposed method. In particular, Naive-Scan, which has been known to show the worst performance, performs much better than LB-Scan as well as ST-Filter in all the cases by employing the proposed method for their CPU processing. This result is so interesting and valuable in that the performance inversion among Naive-Scan, LB-Scan, and ST-Filter has occurred by optimizing the CPU processing step, which is their common performance bottleneck.
机译:本文讨论了时间序列数据库中时间扭曲下子序列匹配的有效处理。时间扭曲是一种转换,即使长度不同,也可以查找具有相似模式的序列。通过初步实验,我们首先指出Naive-Scan的性能瓶颈是在CPU处理步骤中出现的时间瓶颈,它是时间扭曲下处理子序列匹配的基本方法。然后,我们提出了一种新颖的方法,可以优化Naive-Scan的此类CPU处理步骤。所提出的方法通过消除在计算查询序列相对于数据子序列的时间扭曲距离时所需的所有冗余计算,来最大化CPU性能。我们正式证明了该方法不仅是处理Naive-Scan的最佳方法,而且不会引起虚假解雇。此外,我们讨论了如何将建议的方法应用于LB-Scan和ST-Filter(采用过滤步骤的两种众所周知的方法)的后处理步骤。然后,我们通过广泛的实验,定量验证了该方法所获得的性能改进效果。结果表明,采用所提出的方法可以改善所有前三种方法的性能。特别是,已知的Naive-Scan性能最差,在所有情况下,通过采用建议的方法进行CPU处理,其性能都比LB-Scan和ST-Filter好得多。该结果非常有趣且有价值,因为通过优化CPU处理步骤已在Naive-Scan,LB-Scan和ST-Filter之间发生了性能反转,这是它们的常见性能瓶颈。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号