首页> 外文会议>International Conference on Inductive Logic Programming;ILP 2007 >An Efficient Similarity Searching Algorithm Based on Clustering for Time Series
【24h】

An Efficient Similarity Searching Algorithm Based on Clustering for Time Series

机译:基于时间序列聚类的高效相似度搜索算法

获取原文

摘要

Indexing large time series databases is crucial for efficient searching of time series queries. In the paper, we propose a novel indexing scheme RQI (Range Query based on Index) which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning. The basic idea is calculating wavelet coefficient whose first k coefficients are used to form a MBR (minimal bounding rectangle) based on haar wavelet transform for each time series and then using point filtering method; At the same time, lower bounding and upper bounding feature of each time series is calculated, in advance, and stored into index structure. At last, triangle inequality pruning method is used by calculating the distance between time series beforehand. Then we introduce a novel lower bounding distance function SLBS (Symmetrical Lower Bounding based on Segment) and a novel clustering algorithm CSA (Clustering based on Segment Approximation) in order to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure. Extensive experiments over both synthetic and real datasets show that our technologies provide perfect pruning power and could obtain an order of magnitude performance improvement for time series queries over traditional naive evaluation techniques.
机译:索引大型时间序列数据库对于有效搜索时间序列查询至关重要。在本文中,我们提出了一种新颖的索引方案RQI(基于索引的范围查询),该方案包括三种过滤方法:first-k过滤,下限和上限索引以及三角形不等式修剪。基本思想是基于每个时间序列的haar小波变换计算小波系数,其前k个系数用于形成MBR(最小边界矩形),然后使用点滤波方法;同时,预先计算每个时间序列的下界和上界特征,并存储到索引结构中。最后,通过预先计算时间序列之间的距离,采用三角不等式修剪方法。然后介绍一种新颖的下界距离函数SLBS(基于分段的对称下界)和一种新颖的聚类算法CSA(基于分段近似的聚类),以通过保持良好的聚类特性进一步提高点过滤方法的搜索效率。索引结构。在合成数据集和真实数据集上的大量实验表明,我们的技术提供了完美的修剪能力,并且与传统的朴素评估技术相比,可以为时间序列查询提供一个数量级的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号