首页> 外文期刊>EPJ Web of Conferences >ITERATIVE AND PARALLEL PERFORMANCE ANALYSIS OF NON-BLOCKING COMMUNICATION ALGORITHMS IN THE MASSIVELY PARALLEL NEUTRON TRANSPORT CODE PIDOTS
【24h】

ITERATIVE AND PARALLEL PERFORMANCE ANALYSIS OF NON-BLOCKING COMMUNICATION ALGORITHMS IN THE MASSIVELY PARALLEL NEUTRON TRANSPORT CODE PIDOTS

机译:大规模并行中子传输代码PIDOTS中非阻塞通信算法的迭代和并行性能分析

获取原文
       

摘要

The PIDOTS neutral particle transport code utilizes a red/black implementation of the Parallel Gauss-Seidel algorithm to solve the SN approximation of the neutron transport equation on 3D Cartesian meshes. PIDOTS is designed for execution on massively parallel platforms and is capable of using the full resources of modern, leadership class high performance computers. Initial testing revealed that some configurations of PIDOTS’s Integral Transport Matrix Method solver demonstrated unexpectedly poor parallel scaling. Work at Idaho and Los Alamos National Laboratories then revealed that this inefficiency was a result of the accumulation of high-cost latency events in the complex blocking communication networks employed during each PIDOTS iteration. That work explored the possibility of minimizing those inefficiencies while maintaining a blocking communications model. While significant speedups were obtained, it was shown that fully mitigating the problem on general-purpose platforms was highly unlikely for a blocking code. This work continues that analysis by implementing a deeply interleaved non-blocking communication model into PIDOTS. This new model benefits from the optimization work performed on the blocking model while also providing significant opportunities to overlap the remaining un-mitigated communication costs with computation. Additionally, our new approach is easily transferable to other similarly spatially decomposed codes. The resulting algorithm was tested on LANL’s Trinity system at up to 32,768 processors and was found at that processor count to effectively hide 100% of MPI communication cost – equivalently 20% of the red/black phase time. It is expected that the implemented interleaving algorithm can fully support far higher processor counts and completely hide communication costs up ~50% of total iteration time.
机译:所述PIDOTS中性粒子输运代码利用红色/黑色执行并行高斯 - 塞德尔算法来求解三维笛卡尔网格中子输运方程的SN近似的。 PIDOTS是专为大规模并行平台执行,并能够使用现代的,领导带班高性能计算机的资源丰富。初步测试显示的PIDOTS的整体交通运输矩阵法某些配置求解证明意外差平行缩放。在爱达荷州和洛斯阿拉莫斯国家实验室工作,然后发现,这是低效率高成本的延迟事件的过程中每个PIDOTS重复使用的复杂的阻塞通信网络积累的结果。这项工作探索,同时保持阻塞通信模式最大限度地减少那些低效率的可能性。而获得显著的加速,其结果表明,完全缓解在通用平台,问题是要阻挡的代码的可能性极小。这项工作将继续通过实施深交错非阻塞通信模式进入PIDOTS这种分析。从优化工作这种新模式的好处在阻挡模型上进行,同时还提供重叠与计算剩余的未减轻的通信费用显著的机会。此外,我们的新方法是很容易转移到其他类似的空间分解代码。将所得的算法以高达32768个处理器LANL的三位一体系统上测试,并在该处理器的计数被发现有效地隐藏MPI通信成本的100% - 的红色/黑色阶段时间等效的20%。预计实施的交错算法可以完全支持高得多的处理器数量,完全隐藏了通信的费用高达总迭代时间〜50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号