【24h】

ISAX: Indexing and Mining Terabyte Sized Time Series

机译:ISAX:索引和挖掘TB级时间序列

获取原文

摘要

Current research in indexing and mining time series data has produced many interesting algorithms and representations. However, the algorithms and the size of data considered have generally not been representative of the increasingly massive datasets encountered in science, engineering, and business domains. In this work, we show how a novel multi-resolution symbolic representation can be used to index datasets which are several orders of magnitude larger than anything else considered in the literature. Our approach allows both fast exact search and ultra fast approximate search. We show how to exploit the combination of both types of search as sub-routines in data mining algorithms, allowing for the exact mining of truly massive real world datasets, containing millions of time series.
机译:当前在索引和挖掘时间序列数据方面的研究已经产生了许多有趣的算法和表示形式。但是,所考虑的算法和数据大小通常不能代表科学,工程和业务领域中日益庞大的数据集。在这项工作中,我们展示了如何使用新颖的多分辨率符号表示法来索引比文献中所考虑的其他任何方法都大几个数量级的数据集。我们的方法允许快速精确搜索和超快速近似搜索。我们将展示如何在数据挖掘算法中利用这两种类型的搜索作为子例程的组合,从而允许对包含数百万个时间序列的真正庞大的真实世界数据集进行精确挖掘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号