首页> 外文会议>International Conference on Algorithmic Learning Theory >Multiple Pass Streaming Algorithms for Learning Mixtures of Distributions in mathbb RdRd
【24h】

Multiple Pass Streaming Algorithms for Learning Mixtures of Distributions in mathbb RdRd

机译:用于学习分布混音的多遍流算法 MathBB RDRD

获取原文

摘要

We present a multiple pass streaming algorithm for learning the density function of a mixture of k uniform distributions over rectangles (cells) in , for any d > 0. Our learning model is: samples drawn according to the mixture are placed in arbitrary order in a data stream that may only be accessed sequentially by an algorithm with a very limited random access memory space. Our algorithm makes 2 + 1 passes, for any > 0, and requires memory at most . This exhibits a strong memory-space tradeoff: a few more passes significantly lowers its memory requirements, thus trading one of the two most important resources in streaming computation for the other. Chang and Kannan ? first considered this problem for [1] d = 1, 2. Our learning algorithm is especially appropriate for situations where massive data sets of samples are available, but practical computation with such large inputs requires very restricted models of computation.
机译:我们介绍了一种多遍流算法,用于学习k均匀分布在矩形(细胞)中的混合物的密度函数,对于任何D> 0.我们的学习模型是:根据混合物绘制的样品以任意顺序置于a中可以仅通过具有非常有限的随机存取存储器空间算法顺序访问的数据流。我们的算法使2 + 1传递给任何> 0,并且最多需要存储器。这展示了强大的记忆空间权衡:更多的通过显着降低了其内存要求,因此交易了另一个最重要的资源之一。张和肯纳?首先考虑了[1] D = 1,2的这个问题。我们的学习算法特别适用于具有大规模数据集可用的情况,但是具有如此大输入的实际计算需要非常受限的计算模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号