A Dynamic Self-Scheduling Scheme for Heterogeneous Multiprocessor Architectures

MEHMET E. BELVIRANLI; LAXMI N. BHUYAN; RAJIV GUPTA

首页> 外文期刊>ACM Transactions on Architecture and Code Optimization >A Dynamic Self-Scheduling Scheme for Heterogeneous Multiprocessor Architectures

【24h】

A Dynamic Self-Scheduling Scheme for Heterogeneous Multiprocessor Architectures

机译：异构多处理器体系结构的动态自调度方案

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Today's heterogeneous architectures bring together multiple general-purpose CPUs and multiple domain-specific GPUs and FPGAs to provide dramatic speedup for many applications. However, the challenge lies in utilizing these heterogeneous processors to optimize overall application performance by minimizing workload completion time. Operating system and application development for these systems is in their infancy. In this article, we propose a new scheduling and workload balancing scheme, HDSS, for execution of loops having dependent or independent iterations on heterogeneous multiprocessor systems. The new algorithm dynamically learns the computational power of each processor during an adaptive phase and then schedules the remainder of the workload using a weighted self-scheduling scheme during the completion phase. Different from previous studies, our scheme uniquely considers the runtime effects of block sizes on the performance for heterogeneous multiprocessors. It finds the right trade-off between large and small block sizes to maintain balanced workload while keeping the accelerator utilization at maximum. Our algorithm does not require offline training or architecture-specific parameters. We have evaluated our scheme on two different heterogeneous architectures: AMD 64-core Bulldozer system with nVidia Fermi C2050 GPU and Intel Xeon 32-core SGI Altix 4700 supercomputer with Xilinx Virtex 4 FPGAs. The experimental results show that our new scheduling algorithm can achieve performance improvements up to over 200% when compared to the closest existing load balancing scheme. Our algorithm also achieves full processor utilization with all processors completing at nearly the same time which is significantly better than alternative current approaches.

机译：当今的异构体系结构将多个通用CPU和多个特定于域的GPU和FPGA结合在一起，为许多应用程序提供了惊人的加速。但是，挑战在于利用这些异构处理器来通过最小化工作负载完成时间来优化整体应用程序性能。这些系统的操作系统和应用程序开发尚处于起步阶段。在本文中，我们提出了一种新的调度和工作负载平衡方案HDSS，用于在异构多处理器系统上执行具有相关或独立迭代的循环。新算法在自适应阶段动态学习每个处理器的计算能力，然后在完成阶段使用加权自调度方案调度其余工作负载。与以前的研究不同，我们的方案独特地考虑了块大小的运行时间对异构多处理器性能的影响。它在大块和小块大小之间找到了适当的权衡，以保持平衡的工作量，同时保持最大的加速器利用率。我们的算法不需要脱机训练或特定于体系结构的参数。我们已经在两种不同的异构体系结构上评估了该方案：具有nVidia Fermi C2050 GPU的AMD 64核Bulldozer系统和具有Xilinx Virtex 4 FPGA的英特尔至强32核SGI Altix 4700超级计算机。实验结果表明，与最接近的现有负载平衡方案相比，我们的新调度算法可以将性能提高200％以上。我们的算法还实现了所有处理器几乎同时完成的全部处理器利用率，这比当前的替代方法要好得多。

著录项

来源
《ACM Transactions on Architecture and Code Optimization》 |2012年第4期|共20页
作者
MEHMET E. BELVIRANLI; LAXMI N. BHUYAN; RAJIV GUPTA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Algorithms; Performance; Dynamic self-scheduling; Workload balancing; GP-GPUs; FPGAs;

机译：算法;性能;动态自调度;工作负载均衡;GP-GPU;FPGA;

相似文献

外文文献
中文文献
专利

1. A Dynamic Self-Scheduling Scheme for Heterogeneous Multiprocessor Architectures [J] . MEHMET E. BELVIRANLI, LAXMI N. BHUYAN, RAJIV GUPTA ACM Transactions on Architecture and Code Optimization . 2012,第4期

机译：异构多处理器体系结构的动态自调度方案
2. Dynamic Thread Assignment on Heterogeneous Multiprocessor Architectures [J] . Michela Becchi, Patrick Crowley Journal of instruction-level parallelism . 2008,第2008期

机译：异构多处理器体系结构上的动态线程分配
3. Dynamic Thread Assignment on Heterogeneous Multiprocessor Architectures [J] . Michela Becchi and Patrick Crowley Journal of instruction-level parallelism . 2008,第2008期

机译：异构多处理器体系结构上的动态线程分配
4. A Dynamic Partitioning Self-scheduling Scheme for Parallel Loops on Heterogeneous Clusters [C] . Chao-Tung Yang, Wen-Chung Shih, Shian-Shyong Tseng International Conference on Computational Science(ICCS 2006) pt.1; 20060528-31; Reading(GB) . 2006

机译：异构集群上并行循环的动态分区自调度方案
5. Cognitive and Brain-inspired Processing Using Parallel Algorithms and Heterogeneous Chip Multiprocessor Architecture [D] . Mendat, Daniel R. 2017

机译：使用并行算法和异构芯片多处理器架构的认知和大脑启发性处理
6. Dynamic Tables: An Architecture for Managing Evolving Heterogeneous Biomedical Data in Relational Database Management Systems [O] . John Corwin, Avi Silberschatz, Perry L. Miller, 2007

机译：动态表：一种用于在关系数据库管理系统中管理不断发展的异构生物医学数据的体系结构
7. Dynamic thread assignment on heterogeneous multiprocessor architectures [O] . Michela Becchi, Patrick Crowley 2006

机译：异构多处理器架构上的动态线程分配

A Dynamic Self-Scheduling Scheme for Heterogeneous Multiprocessor Architectures

摘要

著录项

相似文献

相关主题

期刊订阅