首页> 外文期刊>Knowledge and Information Systems >Clustering of time-series subsequences is meaningless: implications for previous and future research
【24h】

Clustering of time-series subsequences is meaningless: implications for previous and future research

机译:时间序列子序列的聚类是没有意义的:对先前和未来研究的启示

获取原文
获取原文并翻译 | 示例

摘要

Given the recent explosion of interest in streaming data and online algorithms, clustering of time-series subsequences, extracted via a sliding window, has received much attention. In this work, we make a surprising claim. Clustering of time-series subsequences is meaningless. More concretely, clusters extracted from these time series are forced to obey a certain constraint that is pathologically unlikely to be satisfied by any dataset, and because of this, the clusters extracted by any clustering algorithm are essentially random. While this constraint can be intuitively demonstrated with a simple illustration and is simple to prove, it has never appeared in the literature. We can justify calling our claim surprising because it invalidates the contribution of dozens of previously published papers. We will justify our claim with a theorem, illustrative examples, and a comprehensive set of experiments on reimplementations of previous work. Although the primary contribution of our work is to draw attention to the fact that an apparent solution to an important problem is incorrect and should no longer be used, we also introduce a novel method that, based on the concept of time-series motifs, is able to meaningfully cluster subsequences on some time-series datasets.
机译:鉴于最近对流数据和在线算法的关注激增,通过滑动窗口提取的时间序列子序列的聚类已引起了广泛关注。在这项工作中,我们提出了令人惊讶的主张。时间序列子序列的聚类是没有意义的。更具体地,从这些时间序列提取的聚类被迫服从某种病理上不可能由任何数据集满足的特定约束,因此,由任何聚类算法提取的聚类实质上是随机的。尽管可以通过简单的说明直观地显示此约束并且易于证明,但它从未出现在文献中。我们可以证明我们的说法令人惊讶,因为它使数十篇先前发表的论文的贡献无效。我们将通过一个定理,说明性示例以及一系列有关重新实现先前工作的实验来证明我们的主张是正确的。尽管我们工作的主要贡献是引起人们的注意,即对一个重要问题的表面解决方案是不正确的,不应再使用,但我们还基于时间序列主题的概念,引入了一种新颖的方法,即能够有意义地将子序列聚类到某些时间序列数据集上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号