首页> 外文会议>Balkan Conference in Informatics >Cluster-Based Similarity Search in Time Series
【24h】

Cluster-Based Similarity Search in Time Series

机译:基于群集的相似性搜索时间序列

获取原文

摘要

In this paper, we present a new method that accelerates similarity search implemented via one-nearest neighbor on time series data. The main idea is to identify the most similar time series to a given query without necessarily searching over the whole database. Our method is based on partitioning the search space by applying the K-means algorithm on the data. Then, similarity search is performed hierarchically starting from the cluster that lies most closely to the query. This procedure aims at reaching the most similar series without searching all clusters. In this work, we propose to reduce the intrinsically high dimensionality of time series prior to clustering by applying a well known dimensionality reduction technique, namely, the Piecewise Aggregate Approximation, for its simplicity and efficiency. Experiments are conducted on twelve real-world and synthetic datasets covering a wide range of applications.
机译:在本文中,我们介绍了一种新的方法,可以在时间序列数据上通过一个最接近邻居实现的相似性搜索。主要思想是将最相似的时间序列识别到给定查询,而不需要搜索整个数据库。我们的方法是基于通过在数据上应用K-Means算法来划分搜索空间。然后,相似搜索从群集地从群集地开始,最接近查询。此程序旨在达到最相似的系列而不搜索所有集群。在这项工作中,我们建议通过应用众所周知的维度减少技术,即分段含量近似,以降低聚类前的时间序列的内在高度维度。实验是在十二个现实世界和合成数据集中进行的,涵盖各种应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号