...
首页> 外文期刊>Journal of supercomputing >A parallel multigrid solver for incompressible flows on computing architectures with accelerators
【24h】

A parallel multigrid solver for incompressible flows on computing architectures with accelerators

机译:并行多网格求解器,用于使用加速器的计算体系结构上的不可压缩流

获取原文
获取原文并翻译 | 示例

摘要

An efficient parallel multigrid pressure correction algorithm is proposed for the solution of the incompressible Navier-Stokes equations on computing architectures with acceleration devices. The pressure correction procedure is based on the numerical solution of a Poisson-type problem, which is discretized using a fourth-order finite difference compact scheme. Since this is the most time-consuming part of the solver, we propose a parallel pressure correction algorithm using an iterative method based on a block cyclic reduction solution method combined with a multigrid technique. The grid points are numbered with respect to the red-black ordering scheme for the parallel Gauss-Seidel smoother. These parallelization techniques allow the execution of the entire simulation computations on the acceleration device, minimizing memory communication costs. The realization is developed using the OpenACC API, and the numerical method is demonstrated for the solution of two classical incompressible flow test problems. The first is the two-dimensional lid-driven cavity problem over equal mesh sizes while the other is the Stokes boundary layer, which is a decent benchmark problem for unequal mesh spacing. The effect of several multigrid components on modern and legacy acceleration architectures is examined. Eventually the performance investigation demonstrates that the proposed parallel multigrid solver achieves an acceleration of more than 10 over the sequential solver and more than 4 over multi-core CPU only realizations for all tested accelerators.
机译:针对带有加速装置的计算体系中不可压缩的Navier-Stokes方程的求解,提出了一种有效的并行多网格压力校正算法。压力校正程序基于泊松型问题的数值解,该泊松型问题使用四阶有限差分紧凑方案离散化。由于这是求解器中最耗时的部分,因此我们提出了一种基于迭代的并行压力校正算法,该迭代方法是基于块循环约简求解方法并结合了多网格技术。网格点相对于平行高斯-赛德尔平滑器的红黑排序方案进行了编号。这些并行化技术允许在加速设备上执行整个仿真计算,从而最大程度地减少了存储器通信成本。该实现是使用OpenACC API开发的,并给出了用于解决两个经典不可压缩流量测试问题的数值方法。第一个是在相等网格尺寸上的二维盖驱动腔问题,另一个是斯托克斯边界层,这是网格间距不相等的一个不错的基准问题。研究了多个多网格组件对现代和传统加速架构的影响。最终,性能研究表明,对于所有经过测试的加速器,所提出的并行多网格求解器在顺序求解器上实现了10倍以上的加速,在多核CPU上实现了4倍以上的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号