【24h】

PARALLELIZATION OF AN ADDITIVE MULTIGRID SOLVER

机译:附加多重网格解的并行化

获取原文
获取原文并翻译 | 示例
           

摘要

This article deals with the implementation and performance analysis of a parallel algebraic multigrid solver (pAMG) for a finite-volume, unstructured computational fluid dynamics (CFD) code. The parallelization of the solver is based on the domain decomposition approach using the single program, multiple data paradigm. The Message Passing Interface library (MPI) is used for communication of data. An ILU(0) iterative solver is used for smoothing the errors arising within each partition at the different grid levels, and a multilevel synchronization across the computational domain partitions is enforced in order to improve the performance of the parallelized multigrid solver. Two synchronization strategies are evaluated. In the first the synchronization is applied across the multigrid levels during the restriction step in addition to the base level, while in the second the synchronization is enforced during the restriction and prolongation steps. The effect of gathering the coefficients across partitions for the coarsest level is also investigated. Tests on grids up to 800,000 elements are conducted for a number of diffusion and advection problems on up to 20 processors. Results show that synchronization across partitions for multigrid levels plays an essential role in ensuring good scalability. Furthermore, for a large number of partitions, gathering coefficients across partitions is important to ensure a convergence history that is consistent with the sequential solver, thus yielding the same number of iterations for parallel and sequential runs, which is crucial for retaining high scalability. The shadow-to-core elements ratio is also shown to be a good indicator for scalability.
机译:本文介绍了用于有限体积,非结构化计算流体力学(CFD)代码的并行代数多网格求解器(pAMG)的实现和性能分析。求解程序的并行化基于使用单个程序,多个数据范例的域分解方法。消息传递接口库(MPI)用于数据通信。 ILU(0)迭代求解器用于平滑在不同网格级别上每个分区内出现的错误,并且在计算域分区之间强制执行多级同步,以提高并行化多网格求解器的性能。评估了两种同步策略。在第一个中,除了基本级别外,还在限制步骤中跨多个网格级别应用了同步,而在第二个中,在限制和延长步骤中实施了同步。还研究了针对最粗糙级别跨分区收集系数的影响。在多达20万个处理器上针对多达80万个元素的网格进行了测试,以解决许多扩散和对流问题。结果表明,跨分区的多网格级别同步对于确保良好的可伸缩性起着至关重要的作用。此外,对于大量分区,跨分区收集系数对于确保与顺序求解器一致的收敛历史非常重要,因此对于并行和顺序运行而言,产生的迭代次数相同,这对于保持高可伸缩性至关重要。阴影与核心元素的比率也显示出可伸缩性的良好指标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号