A Data-Oriented Method for Scheduling Dependent Tasks on High-Density Multi-GPU Systems

机译：一种面向数据的高密度多GPU系统上依赖任务的调度方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The rapidly-changing computer architectures, though improving the performance of computers, have been challenging the programming environments for efficiently harnessing the potential of novel architectures. In this area, though the high-density multi-GPU architecture enabled unparalleled performance advantage of dense GPUs in a single server, it has increased the difficulty for scheduling diversified and dependent tasks. We therefore propose a data-oriented method for scheduling dependent tasks for this architecture while providing its implementation. In our method, we model a parallel program as a collection of data-dependent tasks for which data dependencies are managed by an expressive matrix. Accordingly, we develop a hierarchical scheduler infrastructure for our model. In this, a top scheduler is built for querying the data-dependency matrix; three downstream schedulers for queuing computation tasks that are exclusively assigned to processor, accelerator or either; and a multitude of bottom schedulers each for providing a processing element with assigned tasks. We experiment our scheduler for examples of Strassen matrix multiplication and Cholesky matrix inversion algorithms on a computer that has 8 Tesla K40 GPUs. The results show that our method is capable of offering the efficient task parallelism while fulfilling the complex task dependencies. When advanced task-oriented schedulers have been widely designed for distributed systems, a lightweight data-driven scheduler could be an alternative and handy approach that can handle the dependent yet diversified tasks of data-intensive applications for the novel high-density multi-accelerator system.

机译：迅速变化的计算机体系结构尽管提高了计算机的性能，但一直在挑战编程环境以有效利用新型体系结构的潜力。在这一领域，尽管高密度的多GPU架构在单个服务器中实现了密集GPU的无与伦比的性能优势，但它增加了调度多样化和相关任务的难度。因此，我们提出了一种面向数据的方法，用于为该体系结构安排相关任务，同时提供其实现。在我们的方法中，我们将并行程序建模为数据相关任务的集合，数据相关任务由表达矩阵管理。因此，我们为模型开发了一个分层的调度程序基础结构。在此，构建了一个顶部调度程序来查询数据依赖矩阵。三个下游调度程序，用于排队专门分配给处理器，加速器或两者之一的计算任务；以及多个底部调度器，每个调度器用于向处理元件提供分配的任务。我们在装有8个Tesla K40 GPU的计算机上对Strassen矩阵乘法和Cholesky矩阵求逆算法的示例进行了实验。结果表明，我们的方法能够提供有效的任务并行性，同时满足复杂的任务依赖性。当高级面向任务的调度程序已广泛设计用于分布式系统时，轻量级数据驱动的调度程序可能是一种替代且便捷的方法，可以处理新型高密度多加速器系统中数据密集型应用程序的依赖而又多样化的任务。

著录项

来源
《2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, 2015 IEEE 12th International Conference on Embedded Software and Systems》|2015年|694-699|共6页
会议地点 New York NY(US)
作者
Peng Zhang; Yuxiang Gao; Meikang Qiu;
展开▼
作者单位

Biomed. Eng. Dept., Stony Brook Univ., Stony Brook, NY, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
graphics processing units; matrix algebra; parallel programming; scheduling; Cholesky matrix inversion algorithms; Strassen matrix multiplication; data-dependency matrix; data-oriented method; expressive matrix; hierarchical scheduler infrastructure; high-density multiGPU systems; parallel program; queuing computation tasks; task-oriented schedulers; Computer architecture; Computers; Graphics processing units; Partitioning algorithms; Processor scheduling; Runtime; Symmetric matrices; data scheduling; heterogeneous m;

机译：图形处理单元;矩阵代数;并行编程;调度; Cholesky矩阵求逆算法; Strassen矩阵乘法;数据依赖矩阵;面向数据的方法;表达矩阵;分层调度程序基础结构;高密度multiGPU系统;并行程序;排队计算任务面向任务的调度程序;计算机体系结构;计算机;图形处理单元;分区算法;处理器调度;运行时;对称矩阵;数据调度;异构;

相似文献

外文文献
中文文献
专利

1. CP methods for scheduling and routing with time-dependent task costs [J] . Elena Kelareva, Kevin Tierney, Philip Kilby EURO Journal on Computational Optimization . 2014,第3期

机译：具有时间相关任务成本的调度和路由选择CP方法
2. EA-MSCA: An effective energy-aware multi-objective modified sine-cosine algorithm for real-time task scheduling in multiprocessor systems: Methods and analysis [J] . Abdel-Basset Mohamed, Mohamed Reda, Abouhawwash Mohamed, Expert systems with applications . 2021,第Jula期

机译：EA-MSCA：多处理器系统中的实时任务调度的有效能量感知多目标修改正弦算法：方法和分析
3. Efficiency of the Methods of Scheduling Complex Sets of Tasks in Nonuniform Multiprocessor Computer Systems [J] . A. P. Barban, V. V. Ignatushchenko, I. Yu. Podshivalova Automation and Remote Control . 2003,第10期

机译：非均匀多处理器计算机系统中复杂任务调度方法的效率
4. A Data-Oriented Method for Scheduling Dependent Tasks on High-Density Multi-GPU Systems [C] . Peng Zhang, Yuxiang Gao, Meikang Qiu IEEE International Conference on High Performance Computing and Communications . 2015

机译：一种用于在高密度多GPU系统上调度相关任务的数据导向方法
5. Scheduling task with state-dependent deadlines. [D] . Shih, Chi-Sheng. 2003

机译：安排任务的状态取决于最终期限。
6. Applying Dynamic Priority Scheduling Scheme to Static Systems of Pinwheel Task Model in Power-Aware Scheduling [O] . Ye-In Seol, Young-Kuk Kim -1

机译：动态优先级调度方案在动力感知型风车任务模型静态系统中的应用
7. Dynamic Task Scheduling Methods in Heterogeneous Systems- A Survey [O] . D. I. George Amalarethinam, Jamal Mohamed College, A. Maria Josphin 2015

机译：异构系统中的动态任务调度方法 - 一项调查

A Data-Oriented Method for Scheduling Dependent Tasks on High-Density Multi-GPU Systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅