首页> 外文会议>International Conference on Software and Data Technologies >TOWARDS A FASTER SYMBOLIC AGGREGATE APPROXIMATION METHOD
【24h】

TOWARDS A FASTER SYMBOLIC AGGREGATE APPROXIMATION METHOD

机译:朝向更快的符号综合估计方法

获取原文

摘要

The similarity search problem is one of the main problems in time series data mining. Traditionally, this problem was tackled by sequentially comparing the given query against all the time series in the database, and returning all the time series that are within a predetermined threshold of that query. But the large size and the high dimensionality of time series databases that are in use nowadays make that scenario inefficient. There are many representation techniques that aim at reducing the dimensionality of time series so that the search can be handled faster at a lower-dimensional space level. The symbolic aggregate approximation (SAX) is one of the most competitive methods in the literature. In this paper we present a new method that improves the performance of SAX by adding to it another exclusion condition that increases the exclusion power. This method is based on using two representations of the time series: one of SAX and the other is based on an optimal approximation of the time series. Pre-computed distances are calculated and stored offline to be used online to exclude a wide range of the search space using two exclusion conditions. We conduct experiments which show that the new method is faster than SAX.
机译:相似性搜索问题是时间序列数据挖掘中的主要问题之一。传统上,通过顺序地将给定查询与数据库中的所有时间序列进行比较,并返回在该查询的预定阈值内的所有时间序列中来解决这个问题。但是,现在使用的时间序列数据库的大尺寸和高度,使得这种情况效率低下。存在许多表示技术,其旨在减小时间序列的维度,以便可以在低维空间水平下更快地处理搜索。符号聚合近似(SAX)是文献中最具竞争力的方法之一。在本文中,我们介绍了一种新方法,通过添加到增加排除功率的另一个排除条件来提高SAX的性能。该方法基于使用时间序列的两个表示:SAX之一,另一个是基于时间序列的最佳逼近。计算预先计算的距离并将脱机存储在线以使用两个排除条件来排除各种搜索空间。我们进行实验,表明新方法比SAX更快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号