首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >Transparent I/O-Aware GPU Virtualization for Efficient Resource Consolidation
【24h】

Transparent I/O-Aware GPU Virtualization for Efficient Resource Consolidation

机译:透明I / O感知GPU虚拟化以实现高效的资源整合

获取原文
获取外文期刊封面目录资料

摘要

Graphics processing units (GPUs) are widely used in high performance computing (HPC) and cloud computing to accelerate workloads. Virtualization provides flexible access to resources while improving utilization and throughput. This is essential to resource disaggregation, which allows ubiquitous access to remote resources among nodes. However, remote GPU virtualization at scale suffers from severe performance degradation due to inter-node communication and resource consolidation overhead, especially for data-intensive workloads.We propose HFGPU, a GPU virtualization solution transparent to application code based on application programming interface (API) remoting. We define a virtual device manager that allows remote GPUs to be seen, managed, and used as though they were local. To perform at scale we combine multi-adapter InfiniBand networking with a novel distributed I/O forwarding mechanism that eliminates consolidation bottlenecks and reduces data movement. Experiments with up to 1024 NVIDIA V100 GPUs demonstrate overhead lower than 1% for data-intensive operations.
机译:图形处理单元(GPU)广泛用于高性能计算(HPC)和云计算,以加速工作负载。虚拟化在提高利用率和吞吐量的同时提供灵活的资源访问。这对资源分列至关重要,这允许在节点之间无处不在地访问远程资源。但是,由于节点间通信和资源整合开销,尺度以级别的远程GPU虚拟化患有严重的性能下降,特别是对于数据密集型工作负载.WE提出HFGPU,基于应用程序编程接口(API)对应用程序代码透明的GPU虚拟化解决方案远程处理。我们定义了一个虚拟设备管理器,允许看到,管理和使用远程GPU,就像它们是本地的一样。要以缩放执行,我们将多适配器InfiniBand联网与新颖的分布式I / O转发机制相结合,可消除整合瓶颈并减少数据移动。高达1024个NVIDIA V100 GPU的实验表明数据密集型操作的开销低于1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号