首页> 外文学位 >Improving efficiency and effectiveness of dynamic time warping in large time series databases.
【24h】

Improving efficiency and effectiveness of dynamic time warping in large time series databases.

机译:提高大型时间序列数据库中动态时间规整的效率和有效性。

获取原文
获取原文并翻译 | 示例

摘要

In recent years, indexing, classification, clustering, and anomaly detection of time series data have become topics of great interest within the database/data mining community; within those data mining applications, the measurement of similarity between two time series is one of the most essential subroutines. The superiority of Dynamic Time Warping (DTW) similarity measure over Euclidean distance for these tasks has been well-demonstrated by many researchers. However, the greater accuracy of DTW comes at a price of forbiddingly higher CPU demands. The recent introduction of lower bounding function greatly helps address this problem, making DTW calculation tenable for larger datasets.; In spite of the great success of DTW in a variety of domains, there still are several persistent myths about it, causing great confusion in the literature. This work identifies and dispels these myths by empirical demonstration of the findings with a comprehensive set of experiments. Based on these findings, a novel framework, so called Ratanamahatana-Keogh Band ( R-K Band), is proposed, making DTW faster and more accurate by learning arbitrary constraints on the warping path of the DTW calculation. This proposed framework can conveniently replace the existing DTW distance measures, or can be incorporated into any other time series applications; the extensive empirical evaluations demonstrate significant improvement both in accuracy for classification problems, and in precision/recall for a relevance feedback system for time series retrieval. As an augmentation to this work, a bit-level dimensionality reduction technique, a clipped representation, is proposed and shown to improve the compression ratio by a wide margin, while being able to maintain or increase the tightness of its lower bound, allowing even faster nearest neighbor queries, especially in ones that require DTW distance measure.
机译:近年来,对时间序列数据进行索引,分类,聚类和异常检测已成为数据库/数据挖掘社区中的重要话题。在这些数据挖掘应用程序中,两个时间序列之间的相似性度量是最重要的子例程之一。对于这些任务,动态时间规整(DTW)相似性度量优于欧几里得距离的优越性已被许多研究人员充分证明。但是,更高的DTW精度会带来更高的CPU需求。最近引入的下界函数极大地帮助解决了这个问题,使DTW计算可用于较大的数据集。尽管DTW在各个领域都取得了巨大的成功,但关于DTW仍然存在着一些顽固的神话,这在文献中引起了极大的困惑。这项工作通过对实验结果进行全面的实验证明和发现,消除了这些神话。基于这些发现,提出了一种新颖的框架,即所谓的Ratanamahatana-Keogh波段(R-K波段),它通过学习DTW计算翘曲路径上的任意约束来使DTW更快,更准确。该提议的框架可以方便地替换现有的DTW距离度量,或者可以合并到任何其他时间序列应用程序中;广泛的经验评估表明,分类问题的准确性以及时间序列检索的相关性反馈系统的准确性/召回率都得到了显着提高。作为这项工作的补充,提出了一种位级降维技术(裁剪的表示),该技术可以大幅提高压缩率,同时可以保持或提高其下限的紧密度,从而可以更快地进行压缩。最近邻居查询,尤其是在需要DTW距离度量的查询中。

著录项

  • 作者

    Ratanamahatana, Chotirat.;

  • 作者单位

    University of California, Riverside.;

  • 授予单位 University of California, Riverside.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 164 p.
  • 总页数 164
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号