【24h】

Mining motifs in massive time series databases

机译:大量时间序列数据库中的挖掘主题

获取原文
获取原文并翻译 | 示例

摘要

The problem of efficiently locating previously known patterns in a time series database (i.e., query by content) has received much attention and may now largely be regarded as a solved problem. However, from a knowledge discovery viewpoint, a more interesting problem is the enumeration of previously unknown, frequently occurring patterns. We call such patterns "motifs", because of their close analogy to their discrete counterparts in computation biology. An efficient motif discovery algorithm for time series would be useful as a tool for summarizing and visualizing massive time series databases. In addition it could be used as a subroutine in various other data mining tasks, including the discovery of association rules, clustering and classification. In this paper we carefully motivate, then introduce, a nontrivial definition of time series motifs. We propose an efficient algorithm to discover them, and we demonstrate the utility and efficiency of our approach on several real world datasets.
机译:有效地在时间序列数据库中定位先前已知的模式的问题(即,按内容查询)已经引起了很多关注,并且现在可以在很大程度上被认为是解决的问题。但是,从知识发现的角度来看,一个更有趣的问题是枚举以前未知的,频繁发生的模式。我们称这种模式为“基元”,因为它们与计算生物学中的离散基元非常相似。一种有效的时间序列主题发现算法,可用作汇总和可视化大量时间序列数据库的工具。此外,它还可以用作其他各种数据挖掘任务的子例程,包括发现关联规则,聚类和分类。在本文中,我们仔细地激励,然后介绍时间序列主题的非平凡定义。我们提出了一种有效的算法来发现它们,并在一些现实世界的数据集上证明了我们方法的实用性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号