首页> 外文OA文献 >On the design and implementation of broadcast and global combine operations using the postal model
【2h】

On the design and implementation of broadcast and global combine operations using the postal model

机译:论邮政模型的广播和全球结合运算的设计与实现

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

There are a number of models that were proposed in recent years for message passing parallel systems. Examples are the postal model and its generalization the LogP model. In the postal model a parameter λ is used to model the communication latency of the message-passing system. Each node during each round can send a fixed-size message and, simultaneously, receive a message of the same size. Furthermore, a message sent out during round r will incur a latency of hand will arrive at the receiving node at round r + λ - 1. ududOur goal in this paper is to bridge the gap between the theoretical modeling and the practical implementation. In particular, we investigate a number of practical issues related to the design and implementation of two collective communication operations, namely, the broadcast operation and the global combine operation. Those practical issues include, for example, 1) techniques for measurement of the value of λ on a given machine, 2) creating efficient broadcast algorithms that get the latency hand the number of nodes n as parameters and 3) creating efficient global combine algorithms for parallel machines with λ which is not an integer. We propose solutions that address those practical issues and present results of an experimental study of the new algorithms on the Intel Delta machine. Our main conclusion is that the postal model can help in performance prediction and tuning, for example, a properly tuned broadcast improves the known implementation by more than 20%.
机译:近年来,针对消息传递并行系统提出了许多模型。例如邮政模型及其对LogP模型的概括。在邮政模型中,参数λ用于对消息传递系统的通信延迟进行建模。每个回合期间的每个节点都可以发送固定大小的消息,并同时接收相同大小的消息。此外,在回合r期间发出的消息将导致手等待时间到达回合r +λ-1处的接收节点。 ud ud本文的目标是弥合理论模型与实际实现之间的差距。 。特别是,我们调查了与两个集体通信操作(即广播操作和全局合并操作)的设计和实现有关的许多实际问题。这些实际问题包括,例如:1)用于在给定机器上测量λ值的技术; 2)创建有效的广播算法,将等待时间以节点数n作为参数; 3)创建有效的全局合并算法以用于λ不是整数的并行机。我们提出了解决这些实际问题的解决方案,并提出了在英特尔Delta机器上对新算法进行实验研究的结果。我们的主要结论是,邮政模型可以帮助进行性能预测和调整,例如,适当调整广播质量可使已知实现方式提高20%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号