Efficient Implementation of Allreduce on BlueGene/L Collective Network

机译：在BlueGene / L集体网络上有效实施Allreduce

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

BlueGene/L is currently in the pole position on the Top500 list. In its full configuration the system will leverage 65,536 compute nodes. Application scalability is a crucial issue for a system of such size. On BlueGene/L scalability is made possible through the efficient exploitation of special communication. The BlueGene/L system software provides its own optimized version for collective communication routines in addition to the general purpose MPICH2 implementation. The collective network is a natural platform for reduction operations due to its built-in arithmetic units. Unfortunately ALUs of the collective network can handle only fixed point operands. Therefore efficient exploitation of that network for the purpose of floating point reductions is a challenging task. In this paper we present our experiences with implementing an efficient collective network algorithm for Allreduce sums of floating point numbers.

机译：BlueGene / L目前在“ Top500”列表中位居榜首。在完整配置下，系统将利用65,536个计算节点。对于这种规模的系统，应用程序可伸缩性是至关重要的问题。在BlueGene / L上，可通过有效利用特殊通信来实现可伸缩性。除了通用MPICH2实现之外，BlueGene / L系统软件还为集体通信例程提供了自己的优化版本。集合网络具有内置的算术单元，因此是进行归约运算的自然平台。不幸的是，集体网络的ALU只能处理定点操作数。因此，为了减少浮点数而有效利用该网络是一项艰巨的任务。在本文中，我们介绍了实现有效的集合网络算法以减少浮点数之和的经验。

著录项

来源
《European PVM/MPI(Parallel Virtual Machine and Message Passing Interface) Users, Group Meeting; 20050918-21; Sorrento(IT)》|2005年|P.57-66|共10页
会议地点 Sorrento(IT)
作者
George Almasi; Gabor Dozsa; C. Chris Erway; Burkhardt Steinmacher-Burow;
展开▼
作者单位

IBM T. J. Watson Research Center, Yorktown Heights, NY 10598;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类分布式操作系统、并行式操作系统;
关键词
入库时间 2022-08-26 14:16:16

相似文献

外文文献
中文文献
专利

1. Hierarchical Collective Agent Network (HCAN) for efficient fusion and management of multiple networked sensors [J] . Qiuming Zhu, Stuart L. Aldridge, Tomas N. Resha Information Fusion . 2007,第3期

机译：分层集体代理网络（HCAN），用于有效融合和管理多个网络传感器
2. The design and implementation of MPI collective operations for clusters in long-and-fast networks [J] . Motohiko Matsuda, Tomohiro Kudoh, Yuetsu Kodama, Cluster computing . 2008,第1期

机译：高速网络中群集的MPI集合操作的设计和实现
3. The design and implementation of MPI collective operations for clusters in long-and-fast networks [J] . Motohiko Matsuda, Tomohiro Kudoh, Yuetsu Kodama, Cluster Computing . 2008,第1期

机译：高速网络中群集的MPI集合操作的设计和实现
4. Efficient Implementation of Allreduce on BlueGene/L Collective Network [C] . George Almasi, Gabor Dozsa, C. Chris Erway, European PVM/MPI(Parallel Virtual Machine and Message Passing Interface) Users, Group Meeting . 2005

机译：高效实施蓝色/ L集体网络上的遗传
5. Implementation of robot arm networks and experimental analysis of consensus-based collective motion. [D] . Stuart, Daniel Scott. 2009

机译：机器人手臂网络的实现以及基于共识的集体运动的实验分析。
6. Chemical implementation and thermodynamics of collective neural networks. [O] . A Hjelmfelt, J Ross 1992

机译：集体神经网络的化学实现和热力学。
7. A Globally Optimal Energy-Efficient Power Control Framework and Its Efficient Implementation in Wireless Interference Networks [O] . Bho Matthiesen, Alessio Zappone, Karl-Ludwig Besser, 2020

机译：全球最佳节能功率控制框架及其在无线干扰网络中的高效实现

Efficient Implementation of Allreduce on BlueGene/L Collective Network

摘要

著录项

相似文献

相关主题

期刊订阅