An analysis of the feasibility and benefits of GPU/multicore acceleration of the Weather Research and Forecasting model

Vanderbauwhede Wim; Takemi Tetsuya

首页> 外文期刊>Concurrency and computation: practice and experience >An analysis of the feasibility and benefits of GPU/multicore acceleration of the Weather Research and Forecasting model

【24h】

An analysis of the feasibility and benefits of GPU/multicore acceleration of the Weather Research and Forecasting model

机译：天气研究和预报模型的GPU /多核加速的可行性和收益分析

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There is a growing need for ever more accurate climate and weather simulations to be delivered in shorter timescales, in particular, to guard against severe weather events such as hurricanes and heavy rainfall. Due to climate change, the severity and frequency of such events – and thus the economic impact – are set to rise dramatically. Hardware acceleration using graphics processing units (GPUs) or Field-Programmable Gate Arrays (FPGAs) could potentially result in much reduced run times or higher accuracy simulations. In this paper, we present the results of a study of the Weather Research and Forecasting (WRF) model undertaken in order to assess if GPU and multicore acceleration of this type of numerical weather prediction (NWP) code is both feasible and worthwhile. The focus of this paper is on acceleration of code running on a single compute node through offloading of parts of the code to an accelerator such as a GPU. The governing equations set of the WRF model is based on the compressible, non-hydrostatic atmospheric motion with multi-physics processes. We put this work into context by discussing its more general applicability to multi-physics fluid dynamics codes: in many fluid dynamics codes, the numerical schemes of the advection terms are based on finite differences between neighboring cells, similar to the WRF code. For fluid systems including multi-physics processes, there are many calls to these advection routines. This class of numerical codes will benefit from hardware acceleration. We studied the performance of the original code of the WRF model and proposed a simple model for comparing multicore CPU and GPU performance. Based on the results of extensive profiling of representative WRF runs, we focused on the acceleration of the scalar advection module. We discuss the implementation of this module as a data-parallel kernel in both OpenCL and OpenMP. We show that our data-parallel kernel version of the scalar advection module runs up to seven times faster on the GPU compared with the original code on the CPU. However, as the data transfer cost between GPU and CPU is very high (as shown by our analysis), there is only a small speed-up (two times) for the fully integrated code. We show that it would be possible to offset the data transfer cost through GPU acceleration of a larger portion of the dynamics code. In order to carry out this research, we also developed an extensible software system for integrating OpenCL code into large Fortran code bases such as WRF. This is one of the main contributions of our work. We discuss the system to show how it allows the replacement of the sections of the original codebase with their OpenCL counterparts with minimal changes – literally only a few lines – to the original code. Our final assessment is that, even with the current system architectures, accelerating WRF – and hence also other, similar types of multi-physics fluid dynamics codes – with a factor of up to five times is definitely an achievable goal. Accelerating multi-physics fluid dynamics codes including NWP codes is vital for its application to weather forecasting, environmental pollution warning, and emergency response to the dispersion of hazardous materials. Implementing hardware acceleration capability for fluid dynamics and NWP codes is a prerequisite for up-to-date and future computer architectures. Copyright © 2015 John Wiley & Sons, Ltd.

机译：越来越需要在更短的时间范围内提供更准确的气候和天气模拟，尤其是要防范飓风和大雨等严峻的天气事件。由于气候变化，此类事件的严重性和频度以及由此带来的经济影响将急剧上升。使用图形处理单元（GPU）或现场可编程门阵列（FPGA）的硬件加速可能会导致运行时间大大减少或仿真精度更高。在本文中，我们介绍了进行的天气研究和预报（WRF）模型的研究结果，目的是评估这种数字天气预报（NWP）代码的GPU和多核加速是否既可行又值得。本文的重点是通过将部分代码卸载到加速器（例如GPU）上来加速在单个计算节点上运行的代码。 WRF模型的控制方程组基于具有多个物理过程的可压缩非静压大气运动。我们通过讨论其对多物理场流体动力学代码的更普遍适用性来将这项工作放到上下文中：在许多流体动力学代码中，对流项的数值方案都是基于邻近单元之间的有限差异，类似于WRF代码。对于包括多物理场过程在内的流体系统，有许多对流平流程序的调用。此类数字代码将从硬件加速中受益。我们研究了WRF模型原始代码的性能，并提出了一个用于比较多核CPU和GPU性能的简单模型。基于代表性WRF运行的广泛分析结果，我们集中于标量对流模块的加速。我们将在OpenCL和OpenMP中讨论该模块作为数据并行内核的实现。我们证明，标量对流模块的数据并行内核版本在GPU上的运行速度是CPU上原始代码的七倍。但是，由于GPU和CPU之间的数据传输成本非常高（如我们的分析所示），因此完全集成的代码只有很小的提速（两倍）。我们证明，可以通过GPU加速大部分动态代码来抵消数据传输成本。为了进行这项研究，我们还开发了一个可扩展的软件系统，用于将OpenCL代码集成到大型Fortran代码库中，例如WRF。这是我们工作的主要贡献之一。我们讨论该系统，以显示它如何允许在对原始代码进行最少的更改（实际上只有几行）的情况下，用OpenCL对应项替换原始代码库的各个部分。我们的最终评估结果是，即使采用当前的系统体系结构，以高达五倍的倍数加速WRF以及其他类似类型的多物理场流体动力学代码，也绝对是可以实现的目标。加速包括NWP代码在内的多物理场流体动力学代码对于将其应用于天气预报，环境污染预警以及对有害物质扩散的应急响应至关重要。实现流体动力学和NWP代码的硬件加速功能是最新和未来计算机体系结构的前提。版权所有©2015 John Wiley＆Sons，Ltd.

著录项

来源
《Concurrency and computation: practice and experience》 |2016年第7期|2052-2072|共21页
作者
Vanderbauwhede Wim; Takemi Tetsuya;
展开▼
作者单位

University of Glasgow School of Computing Science Glasgow UK;

Kyoto University Disaster Prevention Research Institute Uji Kyoto Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
general‐purpose computation on graphics processing units (GPGPU); parallelization of simulation; large‐scale scientific computing;

机译：图形处理单元（GPGPU）上的通用计算;仿真并行化;大规模科学计算;

相似文献

外文文献
中文文献
专利

1. GPU Acceleration of the Updated Goddard Shortwave Radiation Scheme in the Weather Research and Forecasting (WRF) Model [J] . Mielikainen J. Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2012,第2期

机译：天气研究和预报（WRF）模型中更新的Goddard短波辐射方案的GPU加速
2. Ionospheric TEC forecast model based on support vector machine with GPU acceleration in the China region [J] . Guozhen Xia, Yi Liu, Tongfeng Wei, Advances in space research . 2021,第3期

机译：基于支持向量机的电离层TEC预测模型在中国地区GPU加速度
3. Aerosol analysis and forecast in the European Centre for Medium-Range Weather Forecasts Integrated Forecast System: Forward modeling [J] . J.-J. Morcrette, O. Boucher, L. Jones, Journal of Geophysical Research, D. Atmospheres: JGR . 2009,第6期

机译：欧洲中距离天气预报中心综合预报系统中的气溶胶分析和预报：前向建模
4. An investigation into the feasibility and benefits of GPU/multicore acceleration of the weather research and forecasting model [C] . Vanderbauwhede Wim, Takemi Tetsuya 2013 International Conference on High Performance Computing and Simulation . 2013

机译：GPU /多核加速天气研究和预报模型的可行性和收益研究
5. Hierarchical Bayesian cortical models: Analysis and acceleration on multicore architectures. [D] . Yalamanchili, Pavan Kumar. 2009

机译：分层贝叶斯皮质模型：多核体系结构的分析和加速。
6. A Weather Forecast Model Accuracy Analysis and ECMWF Enhancement Proposal by Neural Network [O] . Jaroslav Frnda, Marek Durica, Jan Nedoma, 2019

机译：神经网络的天气预报模型准确性分析和ECMWF增强建议
7. MODERATED EPOSTERS1385Longitudinal strain assessment in dilated cardiomyopathy patients using a novel accelerated DENSE sequence1407Simultaneous T1 and T2 cardiac quantification with CABIRIA: initial clinical experience1423Head-to-head comparison of acceleration algorithms in 4-dimensional flow CMR1502Left ventricular function and size evaluated by hybrid cardiac positron emission tomography-magnetic resonance: Intraindividual comparison of left ventricular ejection fraction and ventricular volumes derived by two modalities1510Left Atrium assessed by Cardiovascular Magnetic Resonance at 1.5 and 3 Tesla – age and gender effects1514Comparison of Free Breathing Cardiac MRI Radial technique to the Standard Multi breath-hold cine SSFP CMR technique for the assessment of LV Volumes and Function1536Self-navigated free-breathing isotropic 3D whole heart phase sensitive inversion recovery magnetic resonance without navigator for detection of myocardial infarction1547Assessment of Right Ventricular Strain Using Myocardial Deformation Recovery Semi Automated Technique: Initial Experience and Normal Values1586Tissue tracking myocardial deformation analysis and prediction of left ventricular remodeling in acute myocardial infarction1589Investigating strategies for optimal 31P MRS clinical cardiac at 3T: Initial Results1620Quantitative Criteria for the Diagnosis of the Congenital Absence of Pericardium by Cardiac Magnetic Resonance1632Widespread tissue injury during acute myocardial infarction: evidence from advanced CMR relaxometry1322Computed tomography coronary angiography verSus sTRess cArdiac magneTic rEsonance for the manaGement of sYmptomatic revascularized patients: a cost effectiveness study (STRATEGY study)1339Comparison of low- versus high-dose of gadobutrol for late gadolinium enhancement imaging at 1.5 Tesla: a clinical feasibility study1347Multi-parametric Cardiac Magnetic Resonance for Prediction of Cardiac Complications in Thalassemia Intermedia: a Prospective Multicenter Study1461Prognostic value of Cardiovascular Magnetic Resonance derived indexes of myocardial fibrosis in heart transplant recipients1523The role of CMR in the acute phase of hospitalization: changing paradigms1542Preoperative CMR-based score predict ventricular response after surgical left ventricular reconstruction in ischemic heart failure patients1555Excellent response rate to cardiac resynchronization therapy guided with magnetic resonance imaging1626The ECG as a predictor of arrhythmogenic substrate on Cardiac Magnetic Resonance Imaging in patients undergoing ablation for premature ventricular contractions1649Comparison of T1-mapping at 3.0T CMR and angiographic APPROACH score for area at risk assessment in ST-segment elevation myocardial infarction1340Pathological correlates of left bundle branch disease in patients with non-ischemic cardiomyopathy: a cardiovascular magnetic resonance study1342Myocardial remodelling and fibrosis in nonischaemic dilated cardiomyopathy: insights from cardiovascular magnetic resonance1411The association between fibrosis and contractile dysfunction in hypertrophic cardiomyopathy assessed by cardiovascular magnetic resonance1622Persistent myocardial inflammation due to intramyocardial haemorrhage in reperfused STEMI as a precursor to adverse LV remodelling - insights from multi-parametric mapping1566Semiquantitative analysis of low and high b value DWI for detecting myocardial edema in acute myocarditis1567Value of Cardiac MRI In Detecting Coronary Artery Disease In Newly Diagnosed Systolic Dysfunction1570Usefulness of cardiac magnetic resonance in tuberous sclerosis complex1578Papillary muscles offer further insight into hypertrophied hearts: a cardiovascular magnetic resonance study1627Diagnostic and clinical implications of CMR timing (early versus late) in patients with troponin positive acute coronary syndromes and unobstructed coronary arteries: Table 1. [O] . Upasana Tayal, Alexandros Kallifatidis, P. Garg, 2016

机译：在使用新的扩张型心肌病的患者缓和EPOSTERS1385Longitudinal应变评估加速DENSE sequence1407Simultaneous T1和T2与CABIRIA心脏定量：在4维流动的加速算法初始临床experience1423Head对头比较CMR1502Left心室功能和尺寸由混合心脏正电子发射断层摄影术评价 - 磁性共振：由两个modalities1510Left庭派生左室射血分数和心室体积的个体间的比较，在1.5和3特斯拉评估心血管磁共振 - 免费的年龄和性别effects1514Comparison呼吸心脏MRI径向技术标准的多屏气电影SSFP CMR技术的LV卷和Function1536Self-导航自由呼吸各向同性3D整个心脏相位敏感反转恢复磁共振导航仪没有检测右Ventricu心肌infarction1547Assessment的评估拉尔菌株使用心肌变形恢复半自动技术：初步经验和正常Values1586Tissue跟踪心肌变形分析和预测左室重构急性心肌infarction1589Investigating策略优化31P MRS临床心脏在3T：初始Results1620Quantitative标准的先天缺失的诊断心包心脏磁Resonance1632Widespread组织损伤急性心肌梗死时：证据先进CMR relaxometry1322Computed CT冠状动脉成像与压力心脏磁共振对症的管理吻合血管患者：成本效益研究（战略研究）1339Comparison低与高剂量钆布醇在1.5特斯拉晚钆增强成像的临床可行性，在中间型地贫心脏并发症的预测study1347Multi参数心脏核磁共振的前瞻性德穆尔心血管磁共振的ticenter Study1461Prognostic价值衍生心肌纤维化指标在住院的急性期CMR的心脏移植recipients1523The作用：改变基于CMR-paradigms1542Preoperative比分预测缺血性心脏衰竭patients1555Excellent响应速度外科左心室重建心脏再同步化后心室反应治疗与磁共振imaging1626The ECG导引作为心脏磁共振成像致心律失常性基板的在ST段抬高心肌infarction1340Pathological在风险评估经历在3.0T CMR T1映射的室性早搏contractions1649Comparison和血管造影APPROACH分数区域消融患者的预测左束支传导疾病的患者与非缺血性心肌病相关因素：心血管磁共振study1342Myocardial重塑和纤维化nonischaemic扩张型心肌病：在从在由心血管磁性resonance1622Persistent心肌炎症评估肥厚型心肌病的纤维化和收缩功能障碍之间心血管磁性resonance1411The协会由于心肌内出血景点在再灌注STEMI为先导，以不利的LV重塑 - 从低的多参数mapping1566Semiquantitative分析和高的b值DWI的见解为在心脏MRI检测冠状动脉疾病急性myocarditis1567Value在结节性硬化症complex1578Papillary肌肉初诊收缩期Dysfunction1570Usefulness心脏磁共振检测心肌水肿提供进一步的深入了解肥大心脏：心血管磁共振study1627Diagnostic和CMR定时的临床意义（早期与晚）患者肌钙蛋白阳性的急性冠脉综合征和通畅的冠状动脉：表1。
8. Computer Models Used by AFGWC and NMC for Weather Analysis and Forecasting [R] . Conklin, R. J. 1992

机译：aFGWC和NmC用于天气分析和预报的计算机模型

An analysis of the feasibility and benefits of GPU/multicore acceleration of the Weather Research and Forecasting model

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅