Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

Lashkin S. V.; Kozelkov A. S.; Yalozo A. V.; Gerasimov V. Yu.; Zelensky D. K.

首页> 外文期刊>Journal of Applied Mechanics and Technical Physics >Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

【24h】

Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

机译：SIMPLE算法在多处理器计算机上并行实现的效率分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes the details of the parallel implementation of the SIMPLE algorithm for numerical solution of the Navier-Stokes system of equations on arbitrary unstructured grids. The iteration schemes for the serial and parallel versions of the SIMPLE algorithm are implemented. In the description of the parallel implementation, special attention is paid to computational data exchange among processors under the condition of the grid model decomposition using fictitious cells. We discuss the specific features for the storage of distributed matrices and implementation of vector-matrix operations in parallel mode. It is shown that the proposed way of matrix storage reduces the number of interprocessor exchanges. A series of numerical experiments illustrates the effect of the multigrid SLAE solver tuning on the general efficiency of the algorithm; the tuning involves the types of the cycles used (V, W, and F), the number of iterations of a smoothing operator, and the number of cells for coarsening. Two ways (direct and indirect) of efficiency evaluation for parallelization of the numerical algorithm are demonstrated. The paper presents the results of solving some internal and external flow problems with the evaluation of parallelization efficiency by two algorithms. It is shown that the proposed parallel implementation enables efficient computations for the problems on a thousand processors. Based on the results obtained, some general recommendations are made for the optimal tuning of the multigrid solver, as well as for selecting the optimal number of cells per processor.

机译：本文描述了在任意非结构化网格上对Navier-Stokes方程组数值解进行SIMPLE算法并行实现的细节。实现了SIMPLE算法的串行和并行版本的迭代方案。在并行实现的描述中，要特别注意在使用虚拟单元分解网格模型的情况下处理器之间的计算数据交换。我们讨论了分布式矩阵的存储和并行模式下矢量矩阵操作的实现的特定功能。结果表明，所提出的矩阵存储方式减少了处理器间交换的次数。一系列数值实验说明了多网格SLAE求解器调整对算法总体效率的影响。调整涉及所用循环的类型（V，W和F），平滑算子的迭代次数以及用于粗化的像元数目。演示了两种用于数值算法并行化的效率评估方法（直接和间接）。通过两种算法对并行化效率的评估，给出了解决一些内部和外部流动问题的结果。结果表明，所提出的并行实现方式可以对一千个处理器上的问题进行有效的计算。根据获得的结果，提出一些通用建议，以优化多网格求解器的优化，以及选择每个处理器的最佳单元数。

著录项

来源
《Journal of Applied Mechanics and Technical Physics》 |2017年第7期|1242-1259|共18页
作者
Lashkin S. V.; Kozelkov A. S.; Yalozo A. V.; Gerasimov V. Yu.; Zelensky D. K.;
展开▼
作者单位

State Atom Energy Corp Rosatom, Russian Fed Nucl Ctr, All Russia Res Inst Expt Phys, Sarov, Nizhny Novgorod, Russia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
computational fluid dynamics; SIMPLE algorithm; multigrid solver; modeling;

机译：计算流体力学SIMPLE算法多重网格求解器建模;

相似文献

外文文献
中文文献
专利

1. Algorithm Intended for Space–Energy Parallelization in Solving the Criticality Problem and Implemented in the LUCKY_C Multiprocessor Program [J] . A. V. Moryakov Physics of atomic nuclei . 2013,第13期

机译：LUCKY_C多处理器程序中用于解决临界问题的空间能量并行化的算法
2. Studying the Efficiency of the Parallel Algorithm for Solving the Eigenvalue Problem Implemented in the LUCKY-A Computer Code [J] . Egorov A. L., Moryakov A. V., Raskach K. F. Physics of atomic nuclei . 2018,第8期

机译：研究并行算法求解求解幸运 - 计算机代码中的特征值问题的效率
3. Multiprocessor implementation of digital filtering algorithms using a parallel block processing method [J] . Sung W., Mitra S.K. IEEE Transactions on Parallel and Distributed Systems . 1992,第1期

机译：使用并行块处理方法的数字滤波算法的多处理器实现
4. Analysis of an Efficiency of Parallelization of Algorithms Running on Computing Cluster Based on Single-Board Diskless Computers Raspberry PI 3 Model B [C] . Aleksandr B. Vavrenyuk, Darya V. Matveeva, Nikita M. Lukyantsev, IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering . 2021

机译：基于单板无盘计算机的计算集群算法平行化效率分析Raspberry PI 3 Model B
5. The design, analysis, and implementation of parallel simulated annealing and parallel genetic algorithms for the composite graph coloring problem [D] . Elmer, Brent Scott 1993

机译：复合图着色问题的并行模拟退火与并行遗传算法的设计，分析与实现
6. Implementing a Chaotic Cryptosystem by Performing Parallel Computing on Embedded Systems with Multiprocessors [O] . Abraham Flores-Vergara, Everardo Inzunza-González, Enrique Efren García-Guerrero, 2019

机译：通过在具有多处理器的嵌入式系统上执行并行计算来实现混沌密码系统
7. Parallel implementation and evaluation of motion estimation system algorithms on a distributed memory multiprocessor using knowledge based mappings [O] . Huang, Thomas S., Choudhary, Alok Nidhi, Patel, Janak H., 1989

机译：使用基于知识的映射在分布式内存多处理器上并行执行和评估运动估计系统算法
8. Parallel implementation and evaluation of motion estimation system algorithms on a distributed memory multiprocessor using knowledge based mappings [R] . Choudhary, Alok Nidhi, Leung, Mun K., Huang, Thomas S., 1989

机译：使用基于知识的映射在分布式存储器多处理器上并行实现和评估运动估计系统算法

Efficiency Analysis of the Parallel Implementation of the SIMPLE Algorithm on Multiprocessor Computers

摘要

著录项

相似文献

相关主题

期刊订阅