【24h】

Distributed Streams Algorithms for Sliding Windows

机译:滑动Windows的分布式流算法

获取原文
获取原文并翻译 | 示例

摘要

This paper presents algorithms for estimating aggregate functions over a "sliding window" of the N most recent data items in one or more streams. Our results include: 1. For a single stream, we present the first ε-approxima-tion scheme for the number of 1's in a sliding window that is optimal in both worst case time and space. We also present the first ε-approximation scheme for the sum of integers in [0..R] in a sliding window that is optimal in both worst case time and space (assuming R is at most polynomial in N). Both algorithms axe deterministic and use only logarithmic memory words. 2. In contrast, we show that any deterministic algorithm that estimates, to within a small constant relative error, the number of 1's (or the sum of integers) in a sliding window over the union of distributed streams requires Ω(N) space. 3. We present the first randomized (ε, δ)-approximation scheme for the number of 1's in a sliding window over the union of distributed streams that uses only logarithmic memory words. We also present the first (ε, δ)-approximation scheme for the number of distinct values in a sliding window over distributed streams that uses only logarithmic memory words. Our results are obtained using a novel family of synopsis data structures which we call waves.
机译:本文提出了一种算法,用于估算一个或多个流中N个最新数据项的“滑动窗口”上的聚合函数。我们的结果包括:1.对于单个流,我们针对滑动窗口中的1的数量提出了第一种ε近似方案,该方案在最坏情况下的时间和空间均是最佳的。我们还针对滑动窗口中[0..R]中的整数和提出了第一种ε近似方案,该方案在最坏情况下的时间和空间均是最佳的(假设R最多是N的多项式)。两种算法都是确定性的,并且仅使用对数存储字。 2.相反,我们表明,要在较小的恒定相对误差内估算分布式流联合上的滑动窗口中1的数量(或整数之和)的确定性算法都需要Ω(N)空间。 3.我们提出了第一种随机化(ε,δ)近似方案,用于仅使用对数存储字的分布式流联合上的滑动窗口中1的数目。我们还针对仅使用对数存储字的分布式流上的滑动窗口中的不同值的数量,提出了第一种(ε,δ)近似方案。我们的结果是使用新的概要数据结构家族(称为波)获得的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号