首页> 外文会议>SIAM International Conference on Data Mining >A Novel Approximation to Dynamic Time Warping allows Anytime Clustering of Massive Time Series Datasets
【24h】

A Novel Approximation to Dynamic Time Warping allows Anytime Clustering of Massive Time Series Datasets

机译:动态时间翘曲的新颖近似允许随时群集大规模时间序列数据集

获取原文

摘要

Given the ubiquity of time series data, the data mining community has spent significant time investigating the best time series similarity measure to use for various tasks and domains. After more than a decade of extensive efforts, there is increasing evidence that Dynamic Time Warping (DTW) is very difficult to beat. Given that, recent efforts have focused on making the intrinsically slow DTW algorithm faster. For the similarity-search task, an important subroutine in many data mining algorithms, significant progress has been made by replacing the vast majority of expensive DTW calculations with cheap-to-compute lower bound calculations. However, these lower bound based optimizations do not directly apply to clustering, and thus for some realistic problems, clustering with DTW can take days or weeks. In this work, we show that we can mitigate this untenable lethargy by casting DTW clustering as an anytime algorithm. At the heart of our algorithm is a novel data-adaptive approximation to DTW which can be quickly computed, and which produces approximations to DTW that are much better than the best currently known linear-time approximations. We demonstrate our ideas on real world problems showing that we can get virtually all the accuracy of a batch DTW clustering algorithm in a fraction of the time.
机译:鉴于时间序列数据的无处不在,数据挖掘社区花了很大程度上调查用于各种任务和域的最佳时间序列相似度量。经过十多年的广泛努力,越来越多的证据表明动态时间翘曲(DTW)很难击败。鉴于这一点,最近的努力集中于使本质上慢的DTW算法更快。对于相似性 - 搜索任务,许多数据挖掘算法中的一个重要子程序,通过替换廉价到计算下限计算来取代绝大多数昂贵的DTW计算来实现显着进展。但是,这些基于较少的基于界限的优化不直接适用于群集,因此对于一些现实问题,与DTW的聚类可能需要数天或数周。在这项工作中,我们表明我们可以通过将DTW聚类作为随时算法铸造DTW聚类来减轻这种无法维持的嗜好。在我们的算法的核心,是可以快速计算的DTW的新数据自适应近似,并且它产生比最佳当前已知的线性时间近似更好的DTW的近似。我们展示了我们关于现实世界问题的想法,表明我们可以在一小部分时间内实现批量DTW聚类算法的几乎所有准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号