首页> 外文会议>International conference on very large data bases;VLDB 2008 >On Efficiently Searching Trajectories and Archival Data for Historical Similarities
【24h】

On Efficiently Searching Trajectories and Archival Data for Historical Similarities

机译:有效地搜索轨迹和档案数据的历史相似性

获取原文

摘要

We study the problem of efficiently evaluating similarity queries on histories, where a history is a d-dimensional time series for d ≥ 1. While there are some solutions for time-series and spatio-temporal trajectories where typically d ≤ 3, we are not aware of any work that examines the problem for larger values of d. In this paper, we address the problem in its general case and propose a class of summaries for histories with a few interesting properties. First, for commonly used distance functions such as the L_p-norm, LCSS, and DTW, the summaries can be used to efficiently prune some of the histories that cannot be in the answer set of the queries. Second, histories can be indexed based on their summaries, hence the qualifying candidates can be efficiently retrieved. To further reduce the number of unnecessary distance computations for false positives, we propose a finer level approximation of histories, and an algorithm to find an approximation with the least maximum distance estimation error. Experimental results confirm that the combination of our feature extraction approaches and the indexability of our summaries can improve upon existing methods and scales up for larger values of d and database sizes, based on our experiments on real and synthetic datasets of 17-dimensional histories.
机译:我们研究有效地评估历史的相似性查询的问题,其中历史是d≥1的d维时间序列。尽管对于时间序列和时空轨迹有一些解决方案,通常d≤3,但我们不是知道有任何工作可以检查d的较大值的问题。在本文中,我们在一般情况下解决该问题,并提出了一些具有一些有趣特性的历史摘要。首先,对于常用的距离函数(例如L_p-norm,LCSS和DTW),摘要可用于有效地修剪查询答案集中无法包含的某些历史记录。其次,历史可以基于摘要进行索引,因此可以有效地检索符合条件的候选人。为了进一步减少误报不必要的距离计算的数量,我们提出了一个更精细的历史近似值,以及一种算法,以找到最小的最大距离估计误差。实验结果证实,基于我们对17维历史记录的真实数据集和合成数据集进行的实验,我们的特征提取方法与摘要的可索引性的结合可以改善现有方法,并扩大d值和数据库大小,以获取更大的d值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号