首页> 外文会议>International Workshop on Randomization and Computation >Approximating Large Frequency Moments with Pick-and-Drop Sampling
【24h】

Approximating Large Frequency Moments with Pick-and-Drop Sampling

机译:用拾取和丢弃采样近似大频率矩

获取原文

摘要

Given data stream D = {p_1, p_2,..., p_m} of size m of numbers from {1,..., n}, the frequency of i is defined as f_i = |{j : p_j = i}|. The k-th frequency moment of D is defined as F_k = ∑_(i=1)~n f_i~k. We consider the problem of approximating frequency moments in insertion-only streams for k ≥ 3. For any constant c we show an O(n~(1-2/k) log(n) log~(c) (n)) upper bound on the space complexity of the problem. Here log~(c) (n) is the iterative log function. Our main technical contribution is a non-uniform sampling method on matrices. We call our method a pick-and-drop sampling; it samples a heavy element (i.e., element i with frequency Ω(F_k)) with probability Ω(1/n~(1-2/k)) and gives approximation f_i ≥ (1 - ∈)f_i. In addition, the estimations never exceed the real values, that is f_j ≤ f_j for all j. For constant ∈, we reduce the space complexity of finding a heavy element to O(n~(1-2/k) log(n)) bits. We apply our method of recursive sketches and resolve the problem with O(n~(1-2/k) log(n) log~(c) (n)) bits. We reduce the ratio between the upper and lower bounds from O(log~2(n)) to O(log(n) log~(c) (n)). Thus, we provide a (roughly) quadratic improvement of the result of Andoni, Krauthgamer and Onak (FOCS 2011).
机译:给定数据流d = {p_1,p_2,...,p_m}从{1,...,n}的数量的大小m,i的频率被定义为f_i = | {j:p_j = i} | 。 d的k频矩被定义为f_k =Σ_(i = 1)〜n f_i〜k。我们考虑近似于k≥3的插入流中近似频率矩的问题。对于任何常数c,我们显示O(n〜(1-2 / k)log(n)log〜(c)(n))上部绑定问题的空间复杂性。这里日志〜(c)(n)是迭代日志函数。我们的主要技术贡献是矩阵上的非均匀采样方法。我们称我们的方法是一个挑选的抽样;它采样具有概率ω(1 / N〜(1-2 / k))的沉重元素(即,用频率ω(f_k)),并提供近似f_i≥(1 - ∈)f_i。此外,估计永远不会超过真实值,即所有j的f_j≤f_j。对于常数∈,我们降低了向O(n〜(1-2 / k)日志(n))比特找到沉重元素的空间复杂性。我们应用我们的递归草图的方法,并解决O(n〜(1-2 / k)log(n)log〜(c)(n))比特的问题。我们将上限和下限与O(log〜2(n))到O(log(n)log〜(c)(n))之间的比率降低。因此,我们提供了(粗略地)的二次改善了Andoni,Krauthgamer和Onak的结果(Focs 2011)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号