Collocating CPU-only Jobs with GPU-assisted Jobs on GPU-assisted HPC

机译：在GPU辅助的HPC上将仅CPU的作业与GPU辅助的作业并置

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In recent years, GPU has evolved rapidly and exhibited great potential in accelerating scientific applications. Massive GPU-assisted HPC systems have been deployed. However, as a heterogeneous system, GPU-assisted HPC is harder to be programmed and utilized than conventional CPU-only system. Statistics of the Keene land system indicate that the effective utilization rate of computational resources is only about 40% when the system runs in normal condition with enough jobs in its queue. Our theoretical model shows that the lack of overlap between CPU/GPU computation is a major obstacle in the efficient utilization of heterogeneous system. In this paper, we evaluate the possibility of collocating CPU-only job with GPU-assisted job on the same node to increase overlap between CPU/GPU computation, thus achieving better utilization. Several performance compromising factors, such as resource isolation, CPU load, and GPU memory demands, are studied based on workload from popular MPI/CUDA benchmarks. The results indicate that, when those factors are managed properly, the collocated CPU-only job can efficiently scavenge the underutilized CPU resource without affecting the performance of both collocated jobs. Based on this insight, an experimental system with collocation-aware job scheduler and resource manager is proposed. With our experiment workload pool of mixed CPU and GPU jobs, the system demonstrates 15% gain in throughput and 10% gain in both CPU and GPU utilization.

机译：近年来，GPU迅速发展，并在加速科学应用方面显示出巨大潜力。已经部署了大规模的GPU辅助HPC系统。但是，作为异构系统，与传统的仅CPU系统相比，GPU辅助的HPC难以编程和利用。基恩陆地系统的统计数据表明，当系统正常运行且队列中有足够的作业时，计算资源的有效利用率仅为40％左右。我们的理论模型表明，CPU / GPU计算之间缺乏重叠是有效利用异构系统的主要障碍。在本文中，我们评估了将纯CPU作业与GPU辅助作业并置在同一节点上以增加CPU / GPU计算之间的重叠，从而实现更高利用率的可能性。基于流行的MPI / CUDA基准测试的工作负载，研究了一些性能折衷因素，例如资源隔离，CPU负载和GPU内存需求。结果表明，如果适当地管理了这些因素，则并置的仅CPU作业可以有效地清除未充分利用的CPU资源，而不会影响两个并置的作业的性能。基于这种见识，提出了一个具有并置感知作业调度器和资源管理器的实验系统。通过我们的混合CPU和GPU作业的实验工作负载池，该系统显示出吞吐量提高了15％，CPU和GPU利用率都提高了10％。

著录项

来源
《IEEE/ACM international symposium on cluster, cloud and grid computing》|2013年|418-425|共8页
会议地点 Delft(NL)
作者
Wu Jiadong; Hong Bo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. MPI jobs within MPI jobs: A practical way of enabling task-level fault-tolerance in HPC workflows [J] . Wozniak Justin M., Dorier Matthieu, Ross Robert, Future generation computer systems . 2019,第Deca期

机译：MPI作业中的MPI作业：在HPC工作流程中启用任务级容错的实用方法
2. Topology-aware Job Allocation in 3D Torus-based HPC Systems with Hard Job Priority Constraints [J] . Kangkang Li, Maciej Malawski, Jarek Nabrzyski Procedia Computer Science . 2017,第1期

机译：具有硬作业优先级约束的基于3D Torus的HPC系统中的拓扑感知作业分配
3. Using job-shop scheduling tasks for evaluating collocated collaboration [J] . Desney S. Tan, Darren Gergle, Regan Mandryk, Personal and Ubiquitous Computing . 2008,第2期

机译：使用作业车间调度任务评估并置协作
4. Collocating CPU-only jobs with GPU-assisted jobs on GPU-assisted HPC [C] . Jiadong Wu, Bo Hong IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing . 2013

机译：在GPU辅助HPC上使用GPU协助作业搭配仅限CPU的工作
5. Carbon-profit-aware job scheduling and load balancing in geographically distributed cloud for HPC and web applications. [D] . Farrahi Moghaddam, Fereydoun. 2014

机译：在针对HPC和Web应用程序的地理分布的云中，具有碳利润意识的作业调度和负载平衡。
6. Unsupervised KPIs-Based Clustering of Jobs in HPC Data Centers [O] . Mohamed S. Halawa, Rebeca P. Díaz Redondo, Ana Fernández Vilas 2020

机译：无监督的基于KPIS的HPC数据中心群体
7. Dynamic Kernel/Device Mapping Strategies for GPU-assisted HPC Systems [O] . Jiadong Wu, Weiming Shi, Bo Hong 2015

机译：GpU辅助HpC系统的动态内核/设备映射策略

Collocating CPU-only Jobs with GPU-assisted Jobs on GPU-assisted HPC

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅