首页> 外文期刊>Journal of supercomputing >Strategies for maximizing utilization on multi-CPU and multi-GPU heterogeneous architectures
【24h】

Strategies for maximizing utilization on multi-CPU and multi-GPU heterogeneous architectures

机译:在多CP​​U和多GPU异构架构上最大化利用率的策略

获取原文
获取原文并翻译 | 示例
           

摘要

This paper explores the possibility of efficiently executing a single application using multicores simultaneously with multiple GPU accelerators under a parallel task programming paradigm. In particular, we address the challenge of extending a parallel_for template to allow its exploitation on heterogeneous architectures. Due to the asymmetry of the computing resources, we propose in this work a dynamic scheduling strategy coupled with an adaptive partitioning scheme that resizes chunks to prevent underutilization and load imbalance of CPUs and GPUs. In this paper we also address the problem of the underutilization of the CPU core where a host thread operates. To solve it, we propose two different approaches: (1) a collaborative host thread strategy, in which the host thread, instead of busy-waiting for the GPU to complete, it carries out useful chunk processing; and (2) a host thread blocking strategy combined with oversubscription, that delegates on the OS the duty of scheduling threads to available CPU cores in order to guarantee that all cores are doing useful work. Using two benchmarks we evaluate the overhead introduced by our scheduling and partitioning algorithms, finding that it is negligible. We also evaluate the efficiency of the strategies proposed finding that allowing oversubscription controlled by the OS can be beneficial under certain scenarios.
机译:本文探讨了在并行任务编程范式下使用多核与多个GPU加速器同时有效执行单个应用程序的可能性。特别是,我们解决了扩展parallel_for模板以允许其在异构体系结构上利用的挑战。由于计算资源的不对称性,我们在这项工作中提出了一种动态调度策略,并结合了自适应分区方案,该方案调整了块的大小,以防止CPU和GPU的利用率不足和负载不平衡。在本文中,我们还解决了主机线程运行所在的CPU内核利用率不足的问题。为了解决这个问题,我们提出了两种不同的方法:(1)协作主机线程策略,其中主机线程执行有用的块处理,而不是忙于等待GPU完成。 (2)与超额订购相结合的主机线程阻止策略,该策略在OS上将调度线程的职责委托给可用的CPU内核,以确保所有内核都在做有用的工作。使用两个基准,我们评估了调度和分区算法引入的开销,发现它可以忽略不计。我们还评估了建议的策略的效率,发现在某些情况下允许由OS控制的超额预订可能是有益的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号