【24h】

GPUShare: Fair-sharing Middleware for GPU Clouds

机译:GPUSHARE:GPU云共享中间件

获取原文

摘要

Many new cloud-focused applications such as deep learning and graph analytics have started to rely on the high computing throughput of GPUs, but cloud providers cannot currently support fine-grained time-sharing on GPUs to enable multi-tenancy for these types of applications. Currently, scheduling is performed by the GPU driver in combination with a hardware thread dispatcher to maximize utilization. However, when multiple applications with contrasting kernel running times and high-utilization of the GPU need to be co-located, this approach unduly favors one or more of the applications at the expense of others. This paper presents GPUShare, a middleware solution for GPU fair sharing among high-utilization, long-running applications. It begins by analyzing the scenarios under which the current driver-based multi-process scheduling fails, noting that such scenarios are quite common. It then describes a software-based mechanism that can yield a kernel before all of its threads have run, thus giving finer control over the time slice for which the GPU is allocated to a process. In controlling time slices on the GPU by yielding kernels, GPUShare improves fair GPU sharing across tenants and outperforms the CUDA driver by up to 45% for two tenants and by up to 89% for more than two tenants, while incurring a maximum overhead of only 12%. Additional improvements are obtained from having a central scheduler that further smooths out disparities across tenants' GPU shares improving fair sharing by up to 92% for two tenants and by up to 76% for more than two tenants.
机译:许多新的云专注于深度学习和图形分析的应用程序已经开始依赖GPU的高计算吞吐量,但云提供商目前不能支持GPU上的细粒度时间共享,以便为这些类型的应用程序实现多租户。目前,GPU驱动程序与硬件线程调度程序组合执行调度以最大限度地提高利用率。然而,当具有对比的内核运行时间和GPU的高利用率的多个应用程序需要共同定位时,这种方法已经过度地牺牲了一个或多个应用程序。本文介绍了GPUSHARE,一种用于GPU公平分享的中间件解决方案,在高利用率,长期运行的应用中共享。它首先分析了当前基于驱动程序的多进程调度失败的场景,指出这种方案非常常见。然后,它描述了一种基于软件的机制,可以在所有线程运行之前产生内核,从而在将GPU分配给进程的时间片上进行更精细控制。在通过屈服内核控制GPU上的时间片中,GPUSHARE改善了租户的公平GPU分享,并且对于两个租户,占地面积高达45%,占两个以上的租户,而且只有超过两个租户,才能产生最大的租户12%。获得了额外的改进,使中央调度程序进一步平滑租户GPU股的差异,将公平共享高达92%,对于两个以上的租户高达76%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号