...
首页> 外文期刊>ACM Transactions on Computational Theory >The Value of Multiple Read/Write Streams for Approximating Frequency Moments
【24h】

The Value of Multiple Read/Write Streams for Approximating Frequency Moments

机译:近似频率矩的多个读/写流的值

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We consider the read/write streams model, an extension of the standard data stream model in which an algorithm can create and manipulate multiple read/write streams in addition to its input data stream. Like the data stream model, the most important parameter for this model is the amount of internal memory used by such an algorithm. The other key parameters are the number of streams the algorithm uses and the number of passes it makes on these streams. We consider how the addition of multiple streams impacts the ability of algorithms to approximate the frequency moments of the input stream. We show that any randomized read/write stream algorithm with a fixed number of streams and a sublog-arithmic number of passes that produces a constant factor approximation of the k-th frequency moment F_R, of an input sequence of length of at most N from (1, .... N] requires space Ω(N~(1-4/R-δ) for any δ > 0. For comparison, it is known that with a single read-only one-pass data stream there is a randomized constant-factor approximation for F_k using O(N~(1-2/R)) space, and that by sorting, which can be done deterministically in O(log N passes using 3 read/write streams, F_k can be computed exactly. Therefore, although the ability to manipulate multiple read/write streams can add substantial power to the data stream model, with a sublogarithmic number of passes this does not significantly improve the ability to approximate higher frequency moments efficiently. We obtain our results by showing a new connection between the read/write streams model and the multiparty number-in-hand communication model.
机译:我们考虑读/写流模型,它是标准数据流模型的扩展,在该模型中,除输入数据流之外,算法还可以创建和操纵多个读/写流。像数据流模型一样,此模型最重要的参数是该算法使用的内部内存量。其他关键参数是算法使用的流数及其对这些流的通过次数。我们考虑添加多个流如何影响算法近似输入流的频率矩的能力。我们表明,具有固定数量的流和次对数的通过次数的任何随机读/写流算法,都会产生第k个频率矩F_R的常数因子近似值,该序列的输入序列的最大长度为N (1,.... N]对于任何δ> 0都需要空间Ω(N〜(1-4 /R-δ)。为了进行比较,已知对于单个只读单程数据流,存在使用O(N〜(1-2 / R))空间对F_k进行随机化的常数因子近似,并通过排序(可以使用3个读/写流在O(log N次通过)中确定性地进行)来计算F_k因此,尽管操纵多个读/写流的能力可以为数据流模型增加相当大的功能,但通过次对数传递并不能显着提高有效地近似较高频率矩的能力。读/写流模型与多方现有号码公共之间的新连接糖化模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号