Application-specific Topology-aware Mapping for Three Dimensional Topologies

机译：三维拓扑的应用特定拓扑映射

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The fastest supercomputers today such as Blue Gene/L and XT3 are connected by a 3-dimensional torus/mesh interconnect. Applications running on these machines can benefit from topology-awareness while mapping tasks to processors at runtime. By co-locating communicating tasks on nearby processors, the distance traveled by messages and hence the communication traffic can be minimized, thereby reducing communication latency and contention on the network. This paper describes preliminary work utilizing this technique and performance improvements resulting from it in the context of a n-dimensional k-point stencil program. It shows that for a fine-grained application with a high communication to computation ratio, topology-aware mapping has a significant impact on performance. Automated topology-aware mapping by the runtime using similar ideas can relieve the application writer from this burden and result in better performance. Preliminary work towards achieving this for a molecular dynamics application, NAMD, is also presented. Results on up to 32,768 processors of IBM's Blue Gene/L and 2,048 processors of Cray's XT3 support the ideas discussed in the paper.

机译：今天的最快超级计算机如蓝色基因/ L和XT3通过三维圆环互连连接。在这些机器上运行的应用程序可以从拓扑意识中受益，同时将任务映射到运行时的处理器。通过在附近的处理器上定位通信任务，可以最小化消息行进的距离，因此可以最小化通信流量，从而减少网络上的通信延迟和争用。本文介绍了利用此技术的初步工作和在N维k点模板程序的上下文中产生的技术和性能改进。它表明，对于具有高通信的细粒度应用与计算率，拓扑感知映射对性能具有显着影响。使用类似想法的运行时自动拓扑映射可以缓解此负担的应用程序作者并导致更好的性能。还提出了为分子动力学应用而达到这一点的初步努力，NAMD也是如此。结果高达32,768个IBM Blue Gene / L处理器和Cray的XT3的2,048个处理器支持纸张中讨论的想法。

著录项

来源
《International Symposium on Parallel Distributed Processing》|2008年||共8页
会议地点
作者
Abhinav Bhatele; Laxmikant V. Kale;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.138-53;
关键词

相似文献

外文文献
中文文献
专利

1. Mapping application-specific topology to mesh topology with reconfigurable switches [J] . Computers & Digital Techniques, IET . 2020,第1期

机译：使用可重新配置的交换机将特定于应用程序的拓扑映射到网状拓扑
2. Node-Fusion: Topology-aware virtual network embedding algorithm for repeatable virtual network mapping over substrate nodes [J] . Wang Desheng, Zhang Weizhe, He Hui, Concurrency and computation: practice and experience . 2021,第7期

机译：节点融合：用于基板节点的可重复虚拟网络映射的拓扑知识虚拟网络嵌入算法
3. QTMS: A quadratic time complexity topology-aware process mapping method for large-scale parallel applications on shared HPC system [J] . Yan Baicheng, Xiao Limin, Qin Guangjun, Parallel Computing . 2020,第Juna期

机译：QTMS：一种二次时间复杂性拓扑信息，用于共享HPC系统的大规模并行应用的过程映射方法
4. Application-specific Topology-aware Mapping for Three Dimensional Topologies [C] . Abhinav Bhatele, Laxmikant V. Kale International Symposium on Parallel Distributed Processing . 2008

机译：三维拓扑的应用特定拓扑映射
5. Performance analysis and acceleration of nuclear physics application on high-performance computing platforms using GPGPUs and topology-aware mapping techniques [D] . Oryspayev, Dossay. 2016

机译：使用GPGPU和拓扑信息映射技术对高性能计算平台核物理应用的性能分析与加速
6. Topology-aware illumination design for volume rendering [O] . Jianlong Zhou, Xiuying Wang, Hui Cui, 2016

机译：用于体积渲染的拓扑感知照明设计
7. Topology-Aware Mapping Techniques for Heterogeneous HPC Systems: A Systematic Survey [O] . Saad B. Alotaibi, Fathy alboraei 2018

机译：异构HPC系统的拓扑信息映射技术：系统调查

Application-specific Topology-aware Mapping for Three Dimensional Topologies

摘要

著录项

相似文献

相关主题

期刊订阅