首页> 外文期刊>Information Systems >Similarity search of time-warped subsequences via a suffix tree
【24h】

Similarity search of time-warped subsequences via a suffix tree

机译:通过后缀树对时间扭曲的子序列进行相似性搜索

获取原文
获取原文并翻译 | 示例
       

摘要

This paper proposes an indexing technique for fast retrieval of similar subsequences using the time-warping distance. The time-warping distance is a more suitable similarity measure than the Euclidean distance in many applications where sequences may be of different lengths and/or different sampling rates. The proposed indexing technique employs a disk-based suffix tree as an index structure and uses lower-bound distance functions to filter out dissimilar subsequences without false dismissals. To make the index structure compact and hence accelerate the query processing, it converts sequences in the continuous domain into sequences in the discrete domain and stores only a subset of the suffixes whose first values are different from those of the immediately preceding suffixes. Extensive experiments with real and synthetic data sequences revealed that the proposed approach significantly outperforms the sequential scan and LB scan approaches and scales well in a large volume of sequence databases.
机译:本文提出了一种使用时间扭曲距离快速检索相似子序列的索引技术。在许多序列可能具有不同长度和/或不同采样率的应用中,时间扭曲距离是比欧几里得距离更合适的相似性度量。所提出的索引技术采用基于磁盘的后缀树作为索引结构,并使用下限距离函数来筛选出不相似的子序列而不会导致错误解雇。为了使索引结构紧凑,从而加快查询处理速度,它将连续域中的序列转换为离散域中的序列,并仅存储其后缀的第一个值与后一个后缀的值不同的子集。使用真实和合成数据序列进行的大量实验表明,所提出的方法明显优于顺序扫描和LB扫描方法,并且在大量序列数据库中可以很好地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号