首页> 外文会议>Approximation, randomization, and combinatorial optimization : Algorithms and techniques >Approximating Large Frequency Moments with Pick-and-Drop Sampling
【24h】

Approximating Large Frequency Moments with Pick-and-Drop Sampling

机译:通过拾取和采样来近似大频率矩

获取原文
获取原文并翻译 | 示例

摘要

Given data stream D = {p_1,P_2,…, P_m } of size m of numbers from {1,..., n}, the frequency of i is defined as f_i = |{j : p_j = i}|. The k-th frequency moment of D is defined as F_k = Σ_i~n=_1f_i~k.We consider the problem of approximating frequency moments in insertion-only streams for k ≥ 3. For any constant c we show an O(n~(1~2/k) log(n) log~((c))(n)) upper bound on the space complexity of the problem. Here log~((c)) (n) is the iterative log function. Our main technical contribution is a non-uniform sampling method on matrices. We call our method a pick-and-drop sampling; it samples a heavy element (i.e., element i with frequency Ω(F_k)) with probability Ω(l~(1-2/k)) and gives approximation f_i ≥ (1 — ε)f_i. In addition, the estimations never exceed the real values, that is f_j ≤ f_j for all j. For constant e, we reduce the space complexity of finding a heavy element to O(n~(1-2/k) log(n)) bits. We apply our method of recursive sketches and resolve the problem with O(n~(1-2/k) log(n) log~((c))(n)) bits. We reduce the ratio between the upper and lower bounds from O(log~2(n)) to O(log(n) log~((c)) (n)). Thus, we provide a (roughly) quadratic improvement of the result of Andoni, Krauthgamer and Onak (FOCS 2011).
机译:给定来自{1,...,n}的数字大小为m的数据流D = {p_1,P_2,…,P_m},i的频率定义为f_i = | {j:p_j = i} |。 D的第k个频率矩定义为F_k =Σ_i〜n = _1f_i〜k。我们考虑了对于k≥3的仅插入流中的频率矩近似的问题。对于任何常数c,我们都表示O(n〜 (1〜2 / k)log(n)log〜((c))(n))上限是问题的空间复杂度。这里log〜((c))(n)是迭代对数函数。我们的主要技术贡献是对矩阵的非均匀采样方法。我们称我们的方法为拖放采样;它以概率Ω(l / n〜(1-2 / k))对重元素(即频率为Ω(F_k)的元素i)进行采样,并得出近似值f_i≥(1ε)f_i。此外,估计值永远不会超过实际值,即所有j的f_j≤f_j。对于常数e,我们将找到重元素的空间复杂度降低到O(n〜(1-2 / k)log(n))位。我们应用递归草图的方法,并用O(n〜(1-2 / k)log(n)log〜((c))(n))位解决问题。我们将上限和下限之间的比率从O(log〜2(n))减少到O(log(n)log〜((c))(n))。因此,我们对Andoni,Krauthgamer和Onak的结果进行了(大致)二次改进(FOCS 2011)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号