首页> 外文期刊>Data mining and knowledge discovery >Introducing time series snippets: a new primitive for summarizing long time series
【24h】

Introducing time series snippets: a new primitive for summarizing long time series

机译:引入时间序列片段:总结长时间序列的新原子

获取原文
获取原文并翻译 | 示例
       

摘要

The first question a data analyst asks when confronting a new dataset is often, "Show me some representative/typical data." Answering this question is simple in many domains, with random samples or aggregate statistics of some kind. Surprisingly, it is difficult for large time series datasets. The major difficulty is not time or space complexity, but defining what it means to berepresentativedata for this data type. In this work, we show that the obvious candidate definitions: motifs, shapelets, cluster centers, random samples etc., are all poor choices. We introducetime series snippets, a novel representation of typical time series subsequences. Informally, time series snippets can be seen as the answer to the following question. If a user, which could be a human or a higher-level algorithm, only has resources (including human time) to inspectksubsequences of a long time series, whichksubsequences should be chosen? Beyond their utility for visualizing and summarizing massive time series collections, we show that time series snippets have utility for high-level comparison of large time series collections.
机译:数据分析师的第一个问题通常会询问新数据集通常,“向我展示一些代表性/典型数据。”在许多域中回答这个问题很简单,有一个随机的样本或某种聚合统计。令人惊讶的是,大型时间序列数据集很难。主要困难不是时间或空间复杂性,而是定义它对这种数据类型的BerePresentatatata的意义意味着什么。在这项工作中,我们表明明显的候选定义:图案,翻领,群集中心,随机样品等都是差的选择。我们介绍了系列片段,这是一个新颖的典型时间序列子序列的表示。非正式地,时间序列片段可以被视为以下问题的答案。如果用户可以是人类或更高级别的算法,则仅具有资源(包括人为时间)来验证长时间序列的序列,应该选择该序列序列?除了用于可视化和总结大量时间序列收集的实用程序之外,我们展示了时间序列片段具有大型时间序列集合的高级比较的效用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号