首页> 外文会议>International Topical Meeting on Nuclear Reactor Thermal Hydraulics >MITIGATION OF COMMUNICATION LATENCY BASED SLOWDOWNS IN THE PIDOTS MASSIVELY PARALLEL SN TRANSPORT CODE
【24h】

MITIGATION OF COMMUNICATION LATENCY BASED SLOWDOWNS IN THE PIDOTS MASSIVELY PARALLEL SN TRANSPORT CODE

机译:基于PIDOTS的通信延迟减速的减缓巨大并行SN传输代码

获取原文

摘要

The PIDOTS radiation transport code utilizes a spatially decomposed Integral TransportMatrix Method (ITMM) response matrix formulation within a red/black implementation ofthe Parallel Gauss-Seidel (PGS) framework to solve the SN approximation of the neutrontransport equation on 3D Cartesian meshes. The code is intended to fully utilize thecapabilities of modern, massively parallel high-performance computing (HPC) systems. Theoriginal testing of the code verified its implementation but revealed unexpected parallelperformance losses as the processor count increased and the parallelization grain sizedecreased. Further analysis of the code and of the InfiniBand based communicationinterconnect on the Falcon HPC at Idaho National Laboratory demonstrated that tightlycoupled systems of point to point communication could yield larger than expectedslowdowns on general-use HPCs. This slowing effect was further exacerbated by the sheernumber of small data-size messages that PIDOTS used in each iteration. In this work weimplemented a modified communication algorithm that significantly reduces per-iterationtime for fine-grained cases on all processor counts. This provides a 2x speedup for the mostrefined case. Additionally, evaluation of per iteration performance is used to correlatecommunication cost to sub-optimal processor allocations and fabric behavior. Our newcommunication scheme has been evaluated across a variety of HPC systems with diversearchitectures and hardware specifications. Results show that the improvements persist acrossall tested systems. This indicates that the modified communication scheme is likelyapplicable in the future to SN solvers on unstructured meshes and, more generally, to otherhighly-communicative transport codes. Our results may also inform processor schedulingstrategies for HPCs intended for massive multiprocessing.
机译:PIDOTS辐射传输代码利用空间分解的整体运输矩阵方法(ITMM)响应矩阵制定在红色/黑色实现中并行高斯-Seidel(PGS)框架来解决中子的SN逼近3D Cartesian网格的传输方程。该代码旨在充分利用现代,大规模并行高性能计算(HPC)系统的能力。这原始测试代码验证了其实现,但揭示了意外的并行随着处理器计数的增加和平行化粒度的性能损失减少。进一步分析代码和基于InfiniBand的通信在爱达荷州国家实验室的Falcon HPC互连展示了紧紧的点对点通信的耦合系统可以产生大于预期的一般使用HPC的放缓。纯粹的效果进一步加剧了每次迭代中使用的小数据大小消息的数量。在这项工作中我们实现了一种修改的通信算法,可显着减少偏移所有处理器计数的细粒度案例的时间。这为最多提供了2倍的加速精致的情况。另外,使用次迭代性能的评估用于相关次优处理器分配和结构行为的通信成本。我们新的已通过多样化的各种HPC系统进行了评估通信方案架构和硬件规格。结果表明,改进持续存在所有测试系统。这表明可能的修改通信方案很可能在未来适用于非结构化网格上的SN求解器,更一般地到其他高度交流运输代码。我们的结果还可以通知处理器调度用于大规模多处理的HPC的策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号