首页> 外文期刊>Knowledge and information systems >L-BiX: incremental sliding-window aggregation over data streams using linear bidirectional aggregating indexes
【24h】

L-BiX: incremental sliding-window aggregation over data streams using linear bidirectional aggregating indexes

机译:L-BIX:使用线性双向聚合索引对数据流的增量滑动窗口聚合

获取原文
获取原文并翻译 | 示例
           

摘要

The number of real-time information sources, or so-called streams, has rapidly increased, leading to a greater demand for complex analyses over streams. Although many stream analysis methods exist, aggregation is fundamental to ascertain higher levels of knowledge from raw data. In particular, sliding-window aggregation, where aggregations over sliding windows are repeatedly computed, is useful in many real-life applications. Two stacks is the state-of-the-art method to compute sliding-window aggregations incrementally with a O(1) time complexity. However, its performance seriously degrades as the window size increases due to the high overhead to maintain its index. To address this problem, this paper proposes a linear bidirectional index (L-BiX) that exploits two kinds of partial aggregations. Specifically, forward (old-to-new) and backward (new-to-old) aggregations allow efficient computations in an incremental manner. The proposed algorithm requires the same time complexity as two stacks (O(1)). Our experimental evaluation shows that the throughput of L-BiX can be faster by up to 1.71 times than that of two stacks with a 50% smaller memory footprint.
机译:实时信息源或所谓的流的数量迅速增加,导致对流的复杂分析更大的需求。虽然存在许多流分析方法,但聚合是从原始数据确定更高的知识水平的基础。特别是,在许多真实应用中,重复计算滑动窗口聚合,其中在滑动窗口上通过滑动窗口的聚合。两个堆栈是最先进的方法,以逐步计算逐渐计算滑动窗口聚合,以o(1)时间复杂度。然而,由于窗口大小由于高开销而增加,其性能严重降低,以保持其指标。为了解决这个问题,本文提出了利用两种部分聚合的线性双向指数(L-BIX)。具体而言,向前(旧的)和后退(新的)聚合允许以增量方式提供有效的计算。所提出的算法需要与两个堆叠相同的时间复杂度(O(1))。我们的实验评价结果表明,L-BIX的吞吐量可以比两堆叠的吞吐量更快,更快的2.71倍,内存占用50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号