...
首页> 外文期刊>Journal of supercomputing >GPU parallelization of multigrid RANS solver for three-dimensional aerodynamic simulations on multiblock grids
【24h】

GPU parallelization of multigrid RANS solver for three-dimensional aerodynamic simulations on multiblock grids

机译:多嵌段网格上三维空气动力模拟的Multigrid Rans求解器的GPU并行化

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, graphical processing units (GPUs) are leveraged to accelerate Bombardier's full aircraft Navier-Stokes solver, a finite-volume, cell-centered RANS flow solver for multiblock structured grids. The efficiency of different parallel smoothers on GPUs is studied, in the context of solving the RANS equations with a nonlinear full approximation storage multigrid scheme. Many variants of parallel red-black Gauss-Seidel and Jacobi solvers are investigated and their efficiency compared against sequential algorithms such as the lower-upper symmetric Gauss-Seidel solver on both CPUs and GPUs. Parametric studies on three-dimensional aircraft configurations are performed to identify the optimal smoothers and determine the optimal number of smoothing iterations on each multigrid level. Furthermore, the efficiency of different approaches to overlapping the communication and computation for the MPI-CUDA implementation in the multi-GPU code is investigated. The results show that the best runtime with the GPU code is obtained using a weaker smoother with more sweeps per multigrid level, whereas the best runtime with the CPU code is obtained using a stronger smoother with fewer sweeps per multigrid level. Despite using a weaker smoother and therefore more iterations to converge to the final solutions, the GPU-accelerated code is significantly faster than the CPU code.
机译:在本文中,图形处理单元(GPU)被利用以加速庞巴迪的全架Navier-Stokes求解器,有限体积,以用于多块结构网格的rans rans流量求解器。在用非线性全近似存储多基体方案求解RAN方程的背景下,研究了GPU上不同平行的SmooThers的效率。研究了并联红黑色高斯Seidel和Jacobi溶剂的许多变体,并与顺序算法(如CPU和GPU上的下上部对称高斯-Seidel求解器)进行了比较的效率。执行关于三维飞机配置的参数研究,以识别最佳的流量,并确定每个多重资源级别的平滑迭代的最佳数量。此外,研究了在多GPU代码中对MPI-CUDA实现重叠的不同方法的效率和计算。结果表明,使用GPU代码的最佳运行时间使用较弱的更漂亮,每个多个多区级别扫描更多扫描,而使用较强的更漂亮的扫描液的最佳运行时间使用较少的扫描量较少。尽管使用较弱的更弱,因此更频繁迭代到最终解决方案,但GPU加速的代码明显快于CPU代码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号