首页> 外文会议>International conference on very large data bases >Upper and Lower Bounds on the Cost of a Map-Reduce Computation
【24h】

Upper and Lower Bounds on the Cost of a Map-Reduce Computation

机译:上限和下限造成地图减少计算的成本

获取原文

摘要

In this paper we study the tradeoff between parallelism and communication cost in a map-reduce computation. For any problem that is not "embarrassingly parallel," the finer we partition the work of the reducers so that more parallelism can be extracted, the greater will be the total communication between mappers and reducers. We introduce a model of problems that can be solved in a single round of map-reduce computation. This model enables a generic recipe for discovering lower bounds on communication cost as a function of the maximum number of inputs that can be assigned to one reducer. We use the model to analyze the tradeoff for three problems: finding pairs of strings at Hamming distance d, finding triangles and other patterns in a larger graph, and matrix multiplication. For finding strings of Hamming distance 1, we have upper and lower bounds that match exactly. For triangles and many other graphs, we have upper and lower bounds that are the same to within a constant factor. For the problem of matrix multiplication, we have matching upper and lower bounds for one-round map-reduce algorithms. We are also able to explore two-round map-reduce algorithms for matrix multiplication and show that these never have more communication, for a given reducer size, than the best one-round algorithm, and often have significantly less.
机译:在本文中,我们在地图减少计算中研究了并行性和通信成本之间的权衡。对于任何不“令人尴尬的并行”的问题,我们将降低减速器的工作,以便可以提取更多并行性,映射器和减速器之间的总通信越大。我们介绍了一个问题的模型,可以在一轮地图上减少计算中解决。该模型使通用配方能够在通信成本上发现较低限制的函数,作为可以分配给一个减速器的最大输入数的函数。我们使用模型来分析三个问题的权衡:在汉明距离D中查找串的串,在更大的图表中找到三角形和其他模式,以及矩阵乘法。为了找到汉明距离1的琴弦,我们有搭配的上下界限。对于三角形和许多其他图表,我们的上限和下限与恒定因子相同。对于矩阵乘法的问题,我们对一轮映射缩小算法匹配的上限和下限。我们还能够探索两个用于矩阵乘法的地图 - 减少算法,并显示这些从未有更多的通信,因为给定的减速器大小,而不是最佳的单级算法,并且通常具有显着较少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号