GPUShare: Fair-sharing Middleware for GPU Clouds

机译：GPUSHARE：GPU云共享中间件

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many new cloud-focused applications such as deep learning and graph analytics have started to rely on the high computing throughput of GPUs, but cloud providers cannot currently support fine-grained time-sharing on GPUs to enable multi-tenancy for these types of applications. Currently, scheduling is performed by the GPU driver in combination with a hardware thread dispatcher to maximize utilization. However, when multiple applications with contrasting kernel running times and high-utilization of the GPU need to be co-located, this approach unduly favors one or more of the applications at the expense of others. This paper presents GPUShare, a middleware solution for GPU fair sharing among high-utilization, long-running applications. It begins by analyzing the scenarios under which the current driver-based multi-process scheduling fails, noting that such scenarios are quite common. It then describes a software-based mechanism that can yield a kernel before all of its threads have run, thus giving finer control over the time slice for which the GPU is allocated to a process. In controlling time slices on the GPU by yielding kernels, GPUShare improves fair GPU sharing across tenants and outperforms the CUDA driver by up to 45% for two tenants and by up to 89% for more than two tenants, while incurring a maximum overhead of only 12%. Additional improvements are obtained from having a central scheduler that further smooths out disparities across tenants' GPU shares improving fair sharing by up to 92% for two tenants and by up to 76% for more than two tenants.

机译：许多新的云专注于深度学习和图形分析的应用程序已经开始依赖GPU的高计算吞吐量，但云提供商目前不能支持GPU上的细粒度时间共享，以便为这些类型的应用程序实现多租户。目前，GPU驱动程序与硬件线程调度程序组合执行调度以最大限度地提高利用率。然而，当具有对比的内核运行时间和GPU的高利用率的多个应用程序需要共同定位时，这种方法已经过度地牺牲了一个或多个应用程序。本文介绍了GPUSHARE，一种用于GPU公平分享的中间件解决方案，在高利用率，长期运行的应用中共享。它首先分析了当前基于驱动程序的多进程调度失败的场景，指出这种方案非常常见。然后，它描述了一种基于软件的机制，可以在所有线程运行之前产生内核，从而在将GPU分配给进程的时间片上进行更精细控制。在通过屈服内核控制GPU上的时间片中，GPUSHARE改善了租户的公平GPU分享，并且对于两个租户，占地面积高达45％，占两个以上的租户，而且只有超过两个租户，才能产生最大的租户12％。获得了额外的改进，使中央调度程序进一步平滑租户GPU股的差异，将公平共享高达92％，对于两个以上的租户高达76％。

著录项

来源
《IEEE International Parallel and Distributed Processing Symposium Workshops》|2016年|929-1839p|共8页
会议地点
作者
Anshuman Goswami; Jeffrey Young; Karsten Schwan; Naila Farooqui; Ada Gavrilovska; Matthew Wolf; Greg Eisenhauer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-53;
关键词

相似文献

外文文献
中文文献
专利

1. PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs [J] . Zhiqi Wang, Tongxiang Gu 电脑和通信（英文） . 2020,第004期
2. Stream processing on GPUs using distributed multimedia middleware [J] . Michael Repplinger, Philipp Slusallek Concurrency, practice and experience . 2011,第7期

机译：使用分布式多媒体中间件在GPU上进行流处理
3. GPUCIoudSim: an extension of CloudSim for modeling and simulation of GPUs in cloud data centers [J] . Siavashi Ahmad, Momtazpour Mahmoud Journal of supercomputing . 2019,第5期

机译：GPUCIoudSim：CloudSim的扩展，用于对云数据中心中的GPU进行建模和仿真
4. GPUCIoudSim: an extension of CloudSim for modeling and simulation of GPUs in cloud data centers [J] . Siavashi Ahmad, Momtazpour Mahmoud Journal of supercomputing . 2019,第5期

机译：GPUCIoudsim：Cloudsim的扩展，用于云数据中心GPU的建模和仿真
5. GPUShare: Fair-Sharing Middleware for GPU Clouds [C] . Anshuman Goswami, Jeffrey Young, Karsten Schwan, IEEE International Parallel and Distributed Processing Symposium Workshops and PhD Forum . 2016

机译：GPUShare：适用于GPU云的公平共享中间件
6. Implementation of a Distributed Middleware Framework to Span the Edge - Fog - Cloud Tiers of a Mobile Distributed Computing IoE Platform Requiring Real-Time and Near-Real-Time Response [D] . Seal, Arindrajit. 2020

机译：实现分布式中间件框架，用于跨越移动分布式计算IOE平台的边缘 - 云层，需要实时和近实时响应
7. A Cloud-based Middleware for Self-Adaptive IoT-Collaboration Services [O] . Soojin Park, Sungyong Park 2019

机译：用于自适应IoT协作服务的基于云的中间件
8. Stream Processing on GPUs Using Distributed Multimedia Middleware [O] . 2015

机译：使用分布式多媒体中间件在GpU上进行流处理

GPUShare: Fair-sharing Middleware for GPU Clouds

摘要

著录项

相似文献

相关主题

期刊订阅