首页> 外文会议> >Duality-based subsequence matching in time-series databases
【24h】

Duality-based subsequence matching in time-series databases

机译:时间序列数据库中基于对偶的子序列匹配

获取原文
获取外文期刊封面目录资料

摘要

The authors propose a subsequence matching method, Dual Match, which exploits duality in constructing windows and significantly improves performance. Dual Match divides data sequences into disjoint windows and the query sequence into sliding windows, and thus, is a dual approach of the one by C. Faloutsos et al. (1994), which divides data sequences into sliding windows and the query sequence into disjoint windows. We formally prove that our dual approach is correct, i.e., it incurs no false dismissal. We also prove that, given the minimum query length, there is a maximum bound of the window size to guarantee correctness of Dual Match and discuss the effect of the window size on performance. FRM causes a lot of false alarms by storing minimum bounding rectangles rather than individual points representing windows to avoid excessive storage space required for the index. Dual Match solves this problem by directly storing points, but without incurring excessive storage overhead. Experimental results show that, in most cases, Dual Match provides large improvement in both false alarms and performance over FRM, given the same amount of storage space. In particular, for low selectivities (less than 10/sup -4/), Dual Match significantly improves performance up to 430-fold. On the other hand, for high selectivities(more than 10/sup -2/), it shows a very minor degradation (less than 29%). For selectivities in between (10/sup -4//spl sim/10/sup -2/), Dual Match shows performance slightly better than that of FRM. Dual Match is also 4.10/spl sim/25.6 times faster than FRM in building indexes of approximately the same size. Overall, these results indicate that our approach provides a new paradigm in subsequence matching that improves performance significantly in large database applications.
机译:作者提出了一种后续匹配方法,双重匹配,它在构建Windows中利用二元性并显着提高性能。双重匹配将数据序列划分为不相交的窗口和查询序列转换为滑动窗口,因此是C. faloutsos等人的双方法。 (1994),将数据序列划分为滑动窗口和查询序列分成不相交的窗口。我们正式证明我们的双重方法是正确的,即,它不会被错误解雇。我们还证明,鉴于最小查询长度,窗口大小有最大限度的界限,以保证双匹配的正确性,并讨论窗口大小对性能的影响。 FRM通过存储最小边界矩形而不是表示窗口的单个点来引起大量误报,以避免索引所需的过度存储空间。双重匹配通过直接存储点来解决此问题,但不会产生过多的存储开销。实验结果表明,在大多数情况下,双匹配在鉴于相同数量的存储空间,双重匹配在FRM上的误报和性能方面提供了大量的改进。特别是,对于低选择性(小于10 / sup -4 /),双匹配显着提高了430倍的性能。另一方面,对于高选择性(超过10 / sup -2 /),它显示出非常轻微的降解(小于29%)。对于在(10 / SUP-4 // SPL SIM / 10 / SUP-2 /)之间的选择性,双匹配显示性能略高于FRM。双匹配也比FRM在大致相同的尺寸大致相同的索引中的4.10 / SPL SIM / 25.6倍。总的来说,这些结果表明,我们的方法在随后匹配中提供了一种新的范例,这在大型数据库应用程序中提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号