首页> 外文会议>International Symposium on Parallel Distributed Processing >Application-specific Topology-aware Mapping for Three Dimensional Topologies
【24h】

Application-specific Topology-aware Mapping for Three Dimensional Topologies

机译:三维拓扑的应用特定拓扑映射

获取原文

摘要

The fastest supercomputers today such as Blue Gene/L and XT3 are connected by a 3-dimensional torus/mesh interconnect. Applications running on these machines can benefit from topology-awareness while mapping tasks to processors at runtime. By co-locating communicating tasks on nearby processors, the distance traveled by messages and hence the communication traffic can be minimized, thereby reducing communication latency and contention on the network. This paper describes preliminary work utilizing this technique and performance improvements resulting from it in the context of a n-dimensional k-point stencil program. It shows that for a fine-grained application with a high communication to computation ratio, topology-aware mapping has a significant impact on performance. Automated topology-aware mapping by the runtime using similar ideas can relieve the application writer from this burden and result in better performance. Preliminary work towards achieving this for a molecular dynamics application, NAMD, is also presented. Results on up to 32,768 processors of IBM's Blue Gene/L and 2,048 processors of Cray's XT3 support the ideas discussed in the paper.
机译:今天的最快超级计算机如蓝色基因/ L和XT3通过三维圆环互连连接。在这些机器上运行的应用程序可以从拓扑意识中受益,同时将任务映射到运行时的处理器。通过在附近的处理器上定位通信任务,可以最小化消息行进的距离,因此可以最小化通信流量,从而减少网络上的通信延迟和争用。本文介绍了利用此技术的初步工作和在N维k点模板程序的上下文中产生的技术和性能改进。它表明,对于具有高通信的细粒度应用与计算率,拓扑感知映射对性能具有显着影响。使用类似想法的运行时自动拓扑映射可以缓解此负担的应用程序作者并导致更好的性能。还提出了为分子动力学应用而达到这一点的初步努力,NAMD也是如此。结果高达32,768个IBM Blue Gene / L处理器和Cray的XT3的2,048个处理器支持纸张中讨论的想法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号