首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect
【24h】

Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect

机译:评估现代GPU互连:PCIe,NVLink,NV-SLI,NVSwitch和GPudirect

获取原文
获取原文并翻译 | 示例

摘要

High performance multi-GPU computing becomes an inevitable trend due to the ever-increasing demand on computation capability in emerging domains such as deep learning, big data and planet-scale simulations. However, the lack of deep understanding on how modern GPUs can be connected and the real impact of state-of-the-art interconnect technology on multi-GPU application performance become a hurdle. In this paper, we fill the gap by conducting a thorough evaluation on five latest types of modern GPU interconnects: PCIe, NVLink-V1, NVLink-V2, NVLink-SLI and NVSwitch, from six high-end servers and HPC platforms: NVIDIA P100-DGX-1, V100-DGX-1, DGX-2, OLCF's SummitDev and Summit supercomputers, as well as an SLI-linked system with two NVIDIA Turing RTX-2080 GPUs. Based on the empirical evaluation, we have observed four new types of GPU communication network NUMA effects: three are triggered by NVLink's topology, connectivity and routing, while one is caused by PCIe chipset design issue. These observations indicate that, for an application running in a multi-GPU node, choosing the right GPU combination can impose considerable impact on GPU communication efficiency, as well as the application's overall performance. Our evaluation can be leveraged in building practical multi-GPU performance models, which are vital for GPU task allocation, scheduling and migration in a shared environment (e.g., AI cloud and HPC centers), as well as communication-oriented performance tuning.
机译:由于在深度学习,大数据和行星规模模拟等新兴域中的计算能力需求不断增加,高性能多GPU计算成为不可避免的趋势。然而,缺乏对现代GPU如何连接的深刻理解和最先进的互连技术对多GPU应用性能的实际影响成为障碍。在本文中,我们通过对五种最新类型的现代GPU互连进行彻底评估来填补差距:PCIe,NVLINK-V1,NVLINK-V2,NVLINK-SLI和NVSwitch,来自六个高端服务器和HPC平台:NVIDIA P100 -DGX-1,V100-DGX-1,DGX-2,OLCF的Summitdev和峰会超级计算机,以及具有两个NVIDIA的SLI-LINKED系统,TRY RTX-2080 GPU。基于实证评估,我们已经观察到了四种新类型的GPU通信网络Numa效果:三个由NVLink的拓扑,连接和路由触发,而其中一个是由PCIe芯片组设计问题引起的。这些观察结果表明,对于在多GPU节点中运行的应用程序,选择右GPU组合可能对GPU通信效率以及应用的整体性能产生相当大的影响。我们的评估可以在建立实用的多GPU性能模型中利用,这对于共享环境中的GPU任务分配,调度和迁移至关重要,以及以通信为导向的性能调整。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号