【24h】

Nek5000 with OpenACC

机译:Nek5000与OpenACC.

获取原文

摘要

Nek5000 is a computational fluid dynamics code based on the spectral element method used for the simulation of incompressible flows. We follow up on an earlier study which ported the simplified version of Nek5000 to a GPU-accelerated system by presenting the hybrid CPU/GPU implementation of the full Nek5000 code using OpenACC. The matrix-matrix multiplication, the Nek5000 gather-scatter operator and a preconditioned Conjugate Gradient solver have implemented using OpenACC for multi-GPU systems. We report an speed-up of 1.3 on single node of a Cray XK6 when using OpenACC directives in Nek5000. On 512 nodes of the Titan supercomputer, the speed-up can be approached to 1.4. A performance analysis of the Nek5000 code using Score-P and Vampir performance monitoring tools shows that overlapping of GPU kernels with host-accelerator memory transfers would considerably increase the performance of the OpenACC version of Nek5000 code.
机译:NEK5000是基于用于模拟不可压缩流的谱元方法的计算流体动力学码。我们通过使用OpenACC呈现完整NEK5000代码的混合CPU / GPU实现,将简化版本的NEK5000的简化版本的NEK5000移植到GPU加速系统的研究。矩阵矩阵乘法,NEK5000聚散散散算子和预处理的共轭梯度求解器已经为多GPU系统使用OpenACC实现。当在NEK5000中使用OPEACC指令时,我们在CRAY XK6的单个节点上报告了1.3的加速。在泰坦超级计算机的512个节点上,可以将加速器接近1.4。使用得分-P和VAMPIR性能监控工具的NEK5000代码的性能分析表明GPU内核与主机 - 加速器内存传输的重叠将大大提高NEK5000代码的OPEACC版本的性能。

著录项

相似文献

  • 外文文献
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号