A Static Task Scheduling Framework for Independent Tasks Accelerated Using a Shared Graphics Processing Unit

机译：使用共享图形处理单元加速独立任务的静态任务计划框架

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The High Performance Computing (HPC) field is witnessing the increasing use of Graphics Processing Units (GPUs) as application accelerators, due to their massively data-parallel computing architectures and exceptional floating-point computational capabilities. The performance advantage from GPU-based acceleration is primarily derived for GPU computational kernels that operate on large amount of data, consuming all of the available GPU resources. For applications that consist of several independent computational tasks that do not occupy the entire GPU, sequentially using the GPU one task at a time leads to performance inefficiencies. It is therefore important for the programmer to cluster small tasks together for sharing the GPU, however, the best performance cannot be achieved through an ad-hoc grouping and execution of these tasks. In this paper, we explore the problem of GPU tasks scheduling, to allow multiple tasks to efficiently share and be executed in parallel on the GPU. We analyze factors affecting multi-tasking parallelism and performance, followed by developing the multi-tasking execution model as a performance prediction approach. The model is validated by comparing with actual execution scenarios for GPU sharing. We then present the scheduling technique and algorithm based on the proposed model, followed by experimental verifications of the proposed approach using an NVIDIA Fermi GPU computing node. Our results demonstrate significant performance improvements using the proposed scheduling approach, compared with sequential execution of the tasks under the conventional multi-tasking execution scenario.

机译：高性能计算（HPC）领域见证了图形处理单元（GPU）作为应用加速器的越来越多的使用，这是因为它们具有大规模的数据并行计算体系结构和出色的浮点计算功能。基于GPU的加速所带来的性能优势主要来自处理大量数据，消耗所有可用GPU资源的GPU计算内核。对于包含几个不占用整个GPU的独立计算任务的应用程序，依次使用GPU一次执行一项任务会导致性能低下。因此，对于程序员来说，将小型任务聚集在一起以共享GPU非常重要，但是，通过临时分组和执行这些任务无法获得最佳性能。在本文中，我们探讨了GPU任务调度的问题，以允许多个任务有效地共享并在GPU上并行执行。我们分析影响多任务并行性和性能的因素，然后开发多任务执行模型作为性能预测方法。通过与用于GPU共享的实际执行方案进行比较来验证该模型。然后，我们基于提出的模型提出调度技术和算法，然后使用NVIDIA Fermi GPU计算节点对提出的方法进行实验验证。我们的结果表明，与传统的多任务执行方案下按顺序执行任务相比，使用建议的调度方法可以显着提高性能。

著录项

来源
《2011 17th IEEE International Conference on Parallel and Distributed Systems》|2011年|p.88-95|共8页
会议地点
作者
Li Teng; Narayana Vikram K.; El-Ghazawi Tarek;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类分布式操作系统、并行式操作系统;
关键词
GPU; multi-tasking; resource sharing; scheduling;

机译：GPU;多任务;资源共享;调度;

相似文献

外文文献
中文文献
专利

1. Evaluation of Static Mapping for Dynamic Space-Shared Multi-task Processing on FPGAs [J] . Minhas Umar Ibrahim, Woods Roger, Karakonstantis Georgios Journal of signal processing systems for signal, image, and video technology . 2021,第5期

机译：FPGA动态空间共享多任务处理静态映射的评估
2. Combining K-Means and K-Harmonic with Fish School Search Algorithm for data clustering task on graphics processing units [J] . Serapiao Adriane B. S., Correa Guilherme S., Goncalves Felipe B., Applied Soft Computing . 2016,第Null期

机译：将K-Means和K-Harmonic与Fish School搜索算法相结合，在图形处理单元上进行数据聚类任务
3. Quasi-static scheduling of independent tasks for reactive systems [J] . Cortadella J., Kondratyev A., Lavagno L., IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2005,第10期

机译：反应系统独立任务的准静态调度
4. A Static Task Scheduling Framework for Independent Tasks Accelerated Using a Shared Graphics Processing Unit [C] . Li Teng, Narayana Vikram K., El-Ghazawi Tarek IEEE International Conference on Parallel and Distributed Systems . 2011

机译：用于独立任务的静态任务调度框架，使用共享图形处理单元加速
5. H2GS: A hybrid heuristic-genetic scheduling algorithm for static scheduling of tasks on heterogeneous processor networks. [D] . Daoud, Mohammad. 2005

机译：H2GS：一种混合启发式遗传调度算法，用于异构处理器网络上的任务静态调度。
6. Applying Dynamic Priority Scheduling Scheme to Static Systems of Pinwheel Task Model in Power-Aware Scheduling [O] . Ye-In Seol, Young-Kuk Kim -1

机译：动态优先级调度方案在动力感知型风车任务模型静态系统中的应用
7. Task-Based Parallelism for General Purpose Graphics Processing Units and Hybrid Shared-Distributed Memory Systems. [O] . CHALK AIDANBERNARDGERARD 2017

机译：基于任务的通用图形处理单元和混合共享分布式存储器系统的并行性。
8. SCHEDULING INDEPENDENT TASKS ON ONE OR MORE PROCESSORS [R] . Michael H. Rothkopf 1964

机译：调度一个或多个处理器的独立任务

A Static Task Scheduling Framework for Independent Tasks Accelerated Using a Shared Graphics Processing Unit

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅