Automatic Halo Management for the Uintah GPU-Heterogeneous Asynchronous Many-Task Runtime

Peterson Brad; Humphrey Alan; Sunderland Dan; Sutherland James; Saad Tony; Dasari Harish; Berzins Martin

首页> 外文期刊>International journal of parallel programming >Automatic Halo Management for the Uintah GPU-Heterogeneous Asynchronous Many-Task Runtime

【24h】

Automatic Halo Management for the Uintah GPU-Heterogeneous Asynchronous Many-Task Runtime

机译：Uintah GPU异构异步多任务运行时的自动光晕管理

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Uintah computational framework is used for the parallel solution of partial differential equations on adaptive mesh refinement grids using modern supercomputers. Uintah is structured with an application layer and a separate runtime system. Uintah is based on a distributed directed acyclic graph of computational tasks, with a task scheduler that efficiently schedules and executes these tasks on both CPU cores and on-node accelerators. The runtime system identifies task dependencies, creates a task graph prior to the execution of these tasks, automatically generates MPI message tags, and automatically performs halo transfers for simulation variables. Automating halo transfers in a heterogeneous environment poses significant challenges when tasks compute within a few milliseconds, as runtime overhead affects wall time execution, or when simulation variables require large halos spanning most or all of the computational domain, as task dependencies become expensive to process. These challenges are magnified at production scale when application developers require each compute node perform thousands of different halo transfers among thousands simulation variables. The principal contribution of this work is to (1) identify and address inefficiencies that arise when mapping tasks onto the GPU in the presence of automated halo transfers, (2) implement new schemes to reduce runtime system overhead, (3) minimize application developer involvement with the runtime, and (4) show overhead reduction results from these improvements.

机译：Uintah计算框架用于使用现代超级计算机在自适应网格细化网格上并行求解偏微分方程。 Uintah由应用程序层和单独的运行时系统构成。 Uintah基于计算任务的分布式有向无环图，具有任务计划程序，可以在CPU内核和节点加速器上高效地计划和执行这些任务。运行时系统识别任务相关性，在执行这些任务之前创建任务图，自动生成MPI消息标签，并自动执行仿真变量的光环转移。当任务在几毫秒内计算时，由于运行时开销会影响墙时间的执行，或者当仿真变量需要跨越大部分或所有计算域的大光环时，由于任务相关性的处理成本很高，因此在异构环境中自动进行光环转移会带来巨大挑战。当应用程序开发人员要求每个计算节点在数千个模拟变量之间执行数千个不同的光环转移时，这些挑战在生产规模上会放大。这项工作的主要贡献在于（1）识别并解决在存在自动光晕传输的情况下将任务映射到GPU时出现的低效率；（2）实施新方案以减少运行时系统开销；（3）最小化应用程序开发人员的参与（4）显示了这些改进带来的开销减少结果。

著录项

来源
《International journal of parallel programming》 |2019年第6期|1086-1116|共31页
作者
Peterson Brad; Humphrey Alan; Sunderland Dan; Sutherland James; Saad Tony; Dasari Harish; Berzins Martin;
展开▼
作者单位

Univ Utah Sci Comp & Imaging Inst Salt Lake City UT 84112 USA;

Sandia Natl Labs POB 5800 MS 1418 Albuquerque NM 87185 USA;

Univ Utah Dept Chem Engn Salt Lake City UT 84112 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Uintah; Hybrid parallelism; Parallel; GPU; Heterogeneous systems; Stencil computation; Optimization; Concurrency; Halo transfer;

机译：英达混合并行性;平行;GPU;异构系统;模具计算;优化;并发光环转移;

相似文献

外文文献
中文文献
专利

1. An environmental modelling framework based on asynchronous many-tasks: Scalability and usability [J] . de Jong Kor, Panja Debabrata, van Kreveld Marc, Environmental Modelling & Software . 2021,第May期

机译：基于异步多任务的环境建模框架：可扩展性和可用性
2. Asynchronous runtime with distributed manager for task-based programming models [J] . Bosch Jaume, Alvarez Carlos, Jimenez-Gonzalez Daniel, Parallel Computing . 2020,第Sepa期

机译：具有基于任务编程模型的分布式管理器的异步运行时
3. Decentralized Asynchronous Crash-Resilient Runtime Verification [J] . Borzoo Bonakdarpour, Pierre Fraigniaud, Sergio Rajsbaum, LIPIcs : Leibniz International Proceedings in Informatics . 2016,第1期

机译：分散式异步崩溃弹性运行时验证
4. Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime [C] . Muthu Baskaran, Benoît Pradelle, Benoît Meister, 2016 5th Workshop on Extreme-Scale Programming Tools . 2016

机译：基于任务的异步运行时的自动代码生成和数据管理
5. Insightful Performance Analysis of Many-Task Runtimes Through Tool-Runtime Integration [D] . Chaimov, Nicholas A. 2017

机译：通过工具-运行时集成对多任务运行时进行深入的性能分析
6. Reliable Task Management Based on a Smart Contract for Runtime Verification of Sensing and Actuating Tasks in IoT Environments [O] . Lei Hang, Do-Hyeun Kim 2020

机译：基于智能合约的可靠任务管理用于物联网环境中传感和激励任务的运行时验证
7. Insightful Performance Analysis of Many-Task Runtimes through Tool-Runtime Integration [O] . Chaimov Nicholas 2017

机译：通过工具-运行时集成对多任务运行时进行深入的性能分析

Automatic Halo Management for the Uintah GPU-Heterogeneous Asynchronous Many-Task Runtime

摘要

著录项

相似文献

相关主题

期刊订阅