...
首页> 外文期刊>Data & Knowledge Engineering >Approximating sliding windows by cyclic tree-like histograms for efficient range queries
【24h】

Approximating sliding windows by cyclic tree-like histograms for efficient range queries

机译:通过循环树状直方图近似滑动窗口以进行有效的范围查询

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The issue of providing fast approximate answers to range queries on sliding windows with a small consumption of storage space is one of the main challenges in the context of data streams. On the one hand, the importance of this class of queries is widely accepted. They are indeed useful to compute aggregate information over the data stream, allowing us to extract from it more abstract knowledge than point queries. On the other hand, the usage of techniques like synopses based on histograms, sketches, sampling, and so on, makes effective those approaches which require multiple scans on data, which otherwise would be prohibitive from the computational point of view. Among the above techniques, histogram-based approaches are considered one of the most advantageous solutions, at least in case of range queries. It is a matter of fact that histograms show a very good capability of summarizing data preserving quick and accurate answers to range queries. In this paper, we propose a novel histogram-based technique to reduce sliding windows supporting approximate arbitrary range-sum queries. Our histogram, relying on a tree-based structure, is suitable to directly support hierarchical queries and, thus, drill-down and roll-up operations. In addition, the structure well supports sliding window shifting and quick query answering, since it operates in logarithmic time in the sliding window size. A bit-saving approach to encoding tree nodes allows us to compress the sliding window with a little price in terms of accuracy. The contribution of this work is thus not only the proposal of a new specific technique to tackle an important problem but also a deep analysis of the advantages given by the hierarchical approach combined with the bit-saving strategy. A careful experimental analysis validates the method showing its superiority w.r.t. the state of the art.
机译:在不占用存储空间的情况下为滑动窗口上的范围查询提供快速近似答案的问题是数据流上下文中的主要挑战之一。一方面,这类查询的重要性已被广泛接受。它们确实对计算数据流上的聚合信息很有用,从而使我们可以从中提取比点查询更多的抽象知识。另一方面,诸如基于直方图,草图,采样等的提要之类的技术的使用使那些需要对数据进行多次扫描的方法变得有效,否则从计算的角度来看将是禁止的。在以上技术中,至少在范围查询的情况下,基于直方图的方法被认为是最有利的解决方案之一。事实是,直方图显示出很好的汇总数据的能力,可以为范围查询保留快速而准确的答案。在本文中,我们提出了一种新颖的基于直方图的技术来减少支持近似任意范围和查询的滑动窗口。我们的直方图依赖于基于树的结构,适合直接支持分层查询,因此可以进行向下和向上滚动操作。另外,该结构很好地支持滑动窗口移动和快速查询回答,因为它以滑动窗口大小的对数时间运行。一种节省代码的树节点编码方法使我们能够以较低的准确性压缩滑动窗口。因此,这项工作的贡献不仅是解决特定重要问题的新技术的建议,而且是对分层方法与位节省策略相结合的优点的深入分析。仔细的实验​​分析验证了该方法的优越性。最先进的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号