...
首页> 外文期刊>Annals of nuclear energy >Hybrid MPI-communication for the multi-angular S_N parallel sweep on 3-D regular grids
【24h】

Hybrid MPI-communication for the multi-angular S_N parallel sweep on 3-D regular grids

机译:混合MPI通信用于3-D规则网格上的多角度S_N并行扫描

获取原文
获取原文并翻译 | 示例
           

摘要

The Koch-Baker-Alcouffe (KBA) algorithm has been widely used in solving the Boltzmann transport equation in parallel. To improve the scaling efficiency, the sweep is executed from the four corners of a 2-D processor map. Collisions take place when a processor has multiple tasks ready to be solved at one step. In order to handle these collisions, the 2-D processor map is divided into several zones that are defined as sub-processor maps with different octant sweep orders. In order to ensure the communications of these collided processors safe and efficient, four kinds of hybrid MPI-communication algorithms have been analyzed. Hybrid MPI-communications with processor slice achieve better scaling than the ones with block slice. The buffered-blocking communication with processor slice (BBP) achieves the best scaling efficiency. The comparisons of weak-scaling parallel efficiency between the standard KBA algorithm and BBP algorithm for multi-angular S-N sweep have been made. The tests are performed on Tianhe-2 super computer by using 10(2)-10(4) processors. For the test using 10(4) processors, the weak scaling efficiency is improved by over 13 percent compared to the standard KBA algorithm. The BBP and KBA algorithms have been applied to perform the pin-by-pin calculation of 3-D PWR core, which indicates that the BBP algorithm has the same accuracy with the standard KBA algorithm but better efficiency. (C) 2018 Elsevier Ltd. All rights reserved.
机译:Koch-Baker-Alcouffe(KBA)算法已广泛用于并行求解Boltzmann输运方程。为了提高缩放效率,从2-D处理器图的四个角执行扫描。当处理器有多个准备好要一步解决的任务时,就会发生冲突。为了处理这些冲突,将2-D处理器映射划分为几个区域,这些区域定义为具有不同八分音阶扫描顺序的子处理器映射。为了确保这些冲突处理器的通信安全有效,已对四种混合MPI通信算法进行了分析。具有处理器切片的混合MPI通信比具有模块切片的混合MPI通信具有更好的缩放比例。与处理器切片(BBP)的缓冲块通信可实现最佳的缩放效率。进行了标准KBA算法和BBP算法在多角度S-N扫描中弱尺度并行效率的比较。该测试是使用10(2)-10(4)处理器在Tianhe-2超级计算机上执行的。对于使用10(4)处理器的测试,与标准KBA算法相比,弱缩放效率提高了13%以上。已将BBP和KBA算法应用于3-D PWR内核的逐针计算,这表明BBP算法与标准KBA算法具有相同的精度,但效率更高。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号