...
首页> 外文期刊>Knowledge and information systems >Beyond one billion time series: indexing and mining very large time series collections with iSAX2+
【24h】

Beyond one billion time series: indexing and mining very large time series collections with iSAX2+

机译:超过十亿个时间序列:使用iSAX2 +索引和挖掘非常大的时间序列集合

获取原文
获取原文并翻译 | 示例
           

摘要

There is an increasingly pressing need, by several applications in diverse domains, for developing techniques able to index and mine very large collections of time series. Examples of such applications come from astronomy, biology, the web, and other domains. It is not unusual for these applications to involve numbers of time series in the order of hundreds of millions to billions. However, all relevant techniques that have been proposed in the literature so far have not considered any data collections much larger than one-million time series. In this paper, we describe iSAX 2.0 and its improvements, iSAX 2.0 Clustered and iSAX2+, three methods designed for indexing and mining truly massive collections of time series. We show that the main bottleneck in mining such massive datasets is the time taken to build the index, and we thus introduce a novel bulk loading mechanism, the first of this kind specifically tailored to a time series index. We show how our methods allows mining on datasets that would otherwise be completely untenable, including the first published experiments to index one billion time series, and experiments in mining massive data from domains as diverse as entomology, DNA and web-scale image collections.
机译:通过在不同领域中的多个应用,迫切需要开发能够索引和挖掘大量时间序列的技术。这种应用的示例来自天文学,生物学,网络和其他领域。这些应用涉及数亿至数十亿个数量级的时间序列并不少见。但是,迄今为止,在文献中提出的所有相关技术都没有考虑任何远远超过一百万个时间序列的数据收集。在本文中,我们描述了iSAX 2.0及其改进,iSAX 2.0 Clustered和iSAX2 +,这三种方法用于索引和挖掘真正的大量时间序列。我们显示出挖掘此类海量数据集的主要瓶颈是建立索引所花费的时间,因此我们介绍了一种新颖的批量加载机制,这是第一种专门针对时间序列索引量身定制的机制。我们展示了我们的方法如何允许在否则将是完全站不住脚的数据集上进行挖掘,包括首次发表的索引十亿个时间序列的实验,以及从昆虫学,DNA和网络规模图像收集等领域提取大量数据的实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号