首页> 外文期刊>Journal of Applied Mechanics and Technical Physics >Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers
【24h】

Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

机译:SIMPLE算法在多处理器计算机上并行实现的效率分析

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes the details of the parallel implementation of the SIMPLE algorithm for numerical solution of the Navier-Stokes system of equations on arbitrary unstructured grids. The iteration schemes for the serial and parallel versions of the SIMPLE algorithm are implemented. In the description of the parallel implementation, special attention is paid to computational data exchange among processors under the condition of the grid model decomposition using fictitious cells. We discuss the specific features for the storage of distributed matrices and implementation of vector-matrix operations in parallel mode. It is shown that the proposed way of matrix storage reduces the number of interprocessor exchanges. A series of numerical experiments illustrates the effect of the multigrid SLAE solver tuning on the general efficiency of the algorithm; the tuning involves the types of the cycles used (V, W, and F), the number of iterations of a smoothing operator, and the number of cells for coarsening. Two ways (direct and indirect) of efficiency evaluation for parallelization of the numerical algorithm are demonstrated. The paper presents the results of solving some internal and external flow problems with the evaluation of parallelization efficiency by two algorithms. It is shown that the proposed parallel implementation enables efficient computations for the problems on a thousand processors. Based on the results obtained, some general recommendations are made for the optimal tuning of the multigrid solver, as well as for selecting the optimal number of cells per processor.
机译:本文描述了在任意非结构化网格上对Navier-Stokes方程组数值解进行SIMPLE算法并行实现的细节。实现了SIMPLE算法的串行和并行版本的迭代方案。在并行实现的描述中,要特别注意在使用虚拟单元分解网格模型的情况下处理器之间的计算数据交换。我们讨论了分布式矩阵的存储和并行模式下矢量矩阵操作的实现的特定功能。结果表明,所提出的矩阵存储方式减少了处理器间交换的次数。一系列数值实验说明了多网格SLAE求解器调整对算法总体效率的影响。调整涉及所用循环的类型(V,W和F),平滑算子的迭代次数以及用于粗化的像元数目。演示了两种用于数值算法并行化的效率评估方法(直接和间接)。通过两种算法对并行化效率的评估,给出了解决一些内部和外部流动问题的结果。结果表明,所提出的并行实现方式可以对一千个处理器上的问题进行有效的计算。根据获得的结果,提出一些通用建议,以优化多网格求解器的优化,以及选择每个处理器的最佳单元数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号