【24h】

Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models

机译:设计以数据为中心的编程模型的节能通信运行时系统

获取原文

摘要

The insatiable demand of high performance computing is being driven by the most computationally intensive applications such as computational chemistry, climate modeling, nuclear physics, etc. The last couple of decades have observed a tremendous rise in supercomputers with architectures ranging from traditional clusters to system-on-a-chip in order to achieve the petaflop computing barrier. However, with advent of petaflop-plus computing, we have ushered in an era where power efficient system software stack is imperative for execution on exascale systems and beyond. At the same time, computationally intensive applications are exploring programming models beyond traditional message passing, as a combination of Partitioned Global Address Space (PGAS) languages and libraries, providing one-sided communication paradigm with put, get and accumulate primitives. To support the PGAS models, it is critical to design power efficient and high performance one-sided communication runtime systems. In this paper, we design and implement PASCoL, a high performance power aware one-sided communication library using Aggregate Remote Memory Copy Interface (ARMCI), the communication runtime system of Global Arrays. For various communication primitives provided by ARMCI, we study the impact of Dynamic Voltage/Frequency Scaling (DVFS) and a combination of interrupt (blocking)/polling based mechanisms provided by most modern interconnects. We implement our design and evaluate it with synthetic benchmarks using an Infini Band cluster. Our results indicate that PASCoL can achieve significant reduction in energy consumed per byte transfer without additional penalty for various one-sided communication primitives and various message sizes and data transfer patterns.
机译:诸如计算化学,气候建模,核物理等计算密集度最高的应用推动了对高性能计算的无限满足。最近几十年来,超级计算机的蓬勃发展,其架构从传统集群到系统级不等,在芯片上以达到petaflop的计算障碍。但是,随着petaflop-plus计算的出现,我们迎来了一个时代,在exascale系统及更高级别的系统上执行时,高效的系统软件栈必不可少。同时,计算密集型应用程序正在探索超越传统消息传递的编程模型,这些模型是分区全局地址空间(PGAS)语言和库的组合,提供了带有put,get和累计原语的单面通信范例。为了支持PGAS模型,设计功率高效且高性能的单侧通信运行时系统至关重要。在本文中,我们使用全局阵列的通信运行时系统聚合远程内存复制接口(ARMCI)设计并实现了PASCoL,这是一种高性能,具有功耗意识的单面通信库。对于ARMCI提供的各种通信原语,我们研究了动态电压/频率缩放(DVFS)以及大多数现代互连所提供的基于中断(阻止)/轮询机制的组合的影响。我们实施我们的设计,并使用Infini Band群集以综合基准对其进行评估。我们的结果表明,PASCoL可以显着降低每字节传输所消耗的能量,而不会对各种单面通信原语以及各种消息大小和数据传输模式造成额外的损失。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号