首页> 外文学位 >Subseries join and compression of time series data based on non-uniform segmentation.
【24h】

Subseries join and compression of time series data based on non-uniform segmentation.

机译:基于非均匀分段的子序列联接和时间序列数据的压缩。

获取原文
获取原文并翻译 | 示例

摘要

A time series is composed of a sequence of data items that are measured at uniform intervals. Many application areas generate or manipulate time series, including finance, medicine, digital audio, and motion capture. Efficiently searching a large time series database is still a challenging problem, especially when partial or subseries matches are needed.;To empirically evaluate their effectiveness, the proposed subseries join and compression techniques are tested on various publicly available datasets. A large motion capture database is also used to verify the techniques in a real-world application. The experiments show that the proposed subseries join technique can better tolerate noise and local scaling than previous work, and the proposed compression technique can also achieve about 85% higher compression rates than previous work with the same distortion error.;This thesis proposes a new definition of subseries join, a symmetric generalization of subseries matching, which finds similar subseries in two or more time series datasets. A solution is proposed to compute the subseries join based on a hierarchical feature representation. This hierarchical feature representation is generated by an anisotropic diffusion scale-space analysis and a non-uniform segmentation method. Each segment is represented by a minimal polynomial envelope in a reduced-dimensionality space. Based on the hierarchical feature representation, all features in a dataset are indexed in an R-tree, and candidate matching features of two datasets are found by an R-tree join operation. Given candidate matching features, a dynamic programming algorithm is developed to compute the final subseries join. To improve storage efficiency, a hierarchical compression scheme is proposed to compress features. The minimal polynomial envelope representation is transformed to a Bezier spline envelope representation. The control points of each Bezier spline are then hierarchically differenced and an arithmetic coding is used to compress these differences.
机译:时间序列由以均匀间隔测量的一系列数据项组成。许多应用领域会生成或操纵时间序列,包括财务,医学,数字音频和运动捕捉。有效地搜索大型时间序列数据库仍然是一个具有挑战性的问题,尤其是在需要部分或子序列匹配的情况下。为了实证评估它们的有效性,在各种公开可用的数据集上对提出的子序列联接和压缩技术进行了测试。大型运动捕捉数据库也用于验证实际应用中的技术。实验表明,所提出的子序列连接技术比以前的工作能够更好地容忍噪声和局部缩放,并且在具有相同失真误差的情况下,所提出的压缩技术还可以实现比以前的工作高约85%的压缩率。子系列联接的一种,子系列匹配的对称概括,可以在两个或多个时间序列数据集中找到相似的子系列。提出了一种基于层次特征表示来计算子系列联接的解决方案。通过各向异性扩散尺度空间分析和非均匀分割方法生成此分层特征表示。每个片段由降维空间中的最小多项式包络表示。基于分层特征表示,数据集中的所有特征都在R树中建立索引,并且两个数据集的候选匹配特征通过R树连接操作找到。给定候选匹配特征,开发了一种动态编程算法来计算最终的子系列连接。为了提高存储效率,提出了一种分层压缩方案来压缩特征。最小多项式包络表示被转换为贝塞尔样条曲线包络表示。然后,每个Bezier样条曲线的控制点在层次上有所不同,并且使用算术编码来压缩这些差异。

著录项

  • 作者

    Lin, Yi.;

  • 作者单位

    University of Waterloo (Canada).;

  • 授予单位 University of Waterloo (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 166 p.
  • 总页数 166
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:37:52

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号