GPU acceleration of MPAS microphysics WSM6 using OpenACC directives: Performance and verification

Kim Jae Youp; Kang Ji-Sun; Joh Minsu

首页> 外文期刊>Computers & geosciences >GPU acceleration of MPAS microphysics WSM6 using OpenACC directives: Performance and verification

【24h】

GPU acceleration of MPAS microphysics WSM6 using OpenACC directives: Performance and verification

机译：使用OPENACC指令GPU加速MPAS微物理WSM6：性能和验证

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, we accelerated a microphysics scheme embedded within the Model for Prediction Across Scales (MPAS), using OpenACC directives. As one of the most time-consuming physics parameterization schemes, we focused on parallelizing the Weather Research and Forecasting (WRF) single-moment 6-class microphysics scheme (WSM6) onto a graphics processing unit (GPU). We applied several essential methodologies to optimize the performance of WSM6 computation on the GPU, to minimize data transfer between the central processing unit (CPU) and GPU and to reduce the waste of GPU threads during computation. As a result, we achieved GPU runs using 1 T V100 that were 2.38 times faster than 48 message passing interface processes runs, on average. When porting the whole model onto the GPU, we achieved a x5.71 speed-up in WSM6 computation, except in I/ O communication. In addition, the precise verification method distinguished nonlinear chaotic error growth from differences introduced by GPU computation, considering the characteristics of the major output variables from WSM6. We then compared the difference between the CPU and the GPU runs to the difference between CPU runs with different compilers. Moreover, we examined bias in these differences, which can distort the climatology of model simulation. Our approach successfully passed the verification process, and this represents the successful application of GPU acceleration to realistic full-model integration of MPAS.

机译：在本研究中，我们加速了嵌入模型内的微神科方案，以跨尺度（MPA），使用OPENACC指令进行预测。作为最耗时的物理参数化方案之一，我们专注于将天气研究和预测（WRF）单机6级微型药物方案（WSM6）并行化到图形处理单元（GPU）上。我们应用了几种基本方法来优化WSM6计算对GPU的性能，以最大限度地减少中央处理单元（CPU）和GPU之间的数据传输，并在计算期间减少GPU线程的浪费。因此，我们使用1 T V100实现了GPU运行，其平均超过48消息传递接口进程的速度快2.38倍。将整个模型移植到GPU上时，我们在WSM6计算中实现了X5.71的加速，除了I / O通信。此外，考虑到WSM6的主要输出变量的特征，精确验证方法从GPU计算引入的差异区分非线性混沌误差生长。然后，我们比较CPU与GPU之间的差异运行到CPU与不同编译器之间的差异。此外，我们检查了这些差异中的偏见，这可能扭曲模型模拟的气候学。我们的方法成功通过了验证过程，这代表了GPU加速的成功应用，以实现MPA的现实全模型集成。

著录项

来源
《Computers & geosciences》 |2021年第1期|104627.1-104627.12|共12页
作者
Kim Jae Youp; Kang Ji-Sun; Joh Minsu;
展开▼
作者单位

Yonsei Univ Dept Atmospher Sci Seoul South Korea;

Korea Inst Sci & Technol Informat Natl Supercomp Ctr Daejeon South Korea;

Korea Inst Sci & Technol Informat Natl Supercomp Ctr Daejeon South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
GPU acceleration; OpenACC; MPAS; WSM6; Numerical weather/climate model;

机译：GPU加速;OPEACC;MPAS;WSM6;数值天气/气候模型;

相似文献

外文文献
中文文献
专利

1. GPU acceleration of the WSM6 cloud microphysics scheme in GRAPES model [J] . Huadong Xiao, Jing Sun, Xiaofeng Bian, Computers & geosciences . 2013,第SEPa期

机译：GRAPES模型中WSM6云微观方案的GPU加速
2. Performance of a Code Migration for the Simulation of Supersonic Ejector Flow to SMP, MIC, and GPU Using OpenMP, OpenMP+LEO, and OpenACC Directives [J] . C.Couder-Casta?eda, H.Barrios-Pi?a, I.Gitler, Scientific programming . 2015,第4期

机译：使用OpenMP，OpenMP + LEO和OpenACC指令模拟超音速喷射器流向SMP，MIC和GPU的代码迁移性能
3. Performance of a Code Migration for the Simulation of Supersonic Ejector Flow to SMP, MIC, and GPU Using OpenMP, OpenMP plus LEO, and OpenACC Directives [J] . Couder-Castaneda C., Barrios-Pina H., Gitler I., Scientific programming . 2015,第期

机译：使用OpenMP，OpenMP加LEO和OpenACC指令模拟超音速喷射器流向SMP，MIC和GPU的代码迁移性能
4. GPU Acceleration of the FINE/FR CFD Solver in a Heterogeneous Environment with OpenACC Directives [C] . X. M. Shine Zhai, David Gutzwiller, Kunal Puri, International Workshop on Accelerator Programming Using Directives . 2020

机译：GPU与OpenACC指令在异构环境中加速Fine / FR CFD求解器
5. Performance analysis and acceleration of nuclear physics application on high-performance computing platforms using GPGPUs and topology-aware mapping techniques [D] . Oryspayev, Dossay. 2016

机译：使用GPGPU和拓扑信息映射技术对高性能计算平台核物理应用的性能分析与加速
6. Accelerating prediction of chemical shift of protein structures on GPUs: Using OpenACC [O] . Eric Wright, Mauricio H. Ferrato, Alexander J. Bryer, 2020

机译：加速预测GPU上蛋白质结构的化学位移：使用OpenACC
7. Efficient implementation of OpenACC cache directive on NVIDIA GPUs [O] . Ahmad Lashgar, Amirali Baniasadi 2019

机译：高效实现NVIDIA GPU上的OpenACC缓存指令

GPU acceleration of MPAS microphysics WSM6 using OpenACC directives: Performance and verification

摘要

著录项

相似文献

相关主题

期刊订阅