...
首页> 外文期刊>International Journal of High Performance Computing Applications >Data decomposition in Monte Carlo neutron transport simulations using global view arrays
【24h】

Data decomposition in Monte Carlo neutron transport simulations using global view arrays

机译:使用全局视图阵列的蒙特卡洛中子输运模拟中的数据分解

获取原文
获取原文并翻译 | 示例
           

摘要

Accommodating large tally data can be a challenging problem for Monte Carlo neutron transport simulations. Current approaches include either simple data replication, or are based on application-controlled decomposition such as domain partitioning or client/server models, which are limited by either memory cost or performance loss. We propose and analyze an alternative solution based on global view arrays. By using global view arrays, tallies are naturally partitioned into small globally addressable blocks that fit in the limited on-node memory of compute nodes, achieving both highly scalable memory and performance efficiency. This approach also greatly simplifies the programmability compared with application-controlled approaches. Our implementation is based on integrating a global view library built on MPI one-sided communication, global view resilience (GVR), into the OpenMC Monte Carlo transport code. The remote memory access (RMA)-based global view array implementation is able to achieve 85% efficiency at 16,384 processes compared with 1,000 processes with 2.39TB mesh tally across 1,366 nodes on a Cray XC30 supercomputer. Our results improve scalability significantly compared with the tally server approach and are better than any other published results, indicating that global view array is a promising alternative to enable full-core light water reactor analysis on current and future computer systems.
机译:对于蒙特卡洛中子输运模拟来说,容纳大计数数据可能是一个具有挑战性的问题。当前的方法包括简单的数据复制,或基于应用程序控制的分解(例如域分区或客户端/服务器模型),这些分解受内存成本或性能损失的限制。我们提出并分析基于全局视图数组的替代解决方案。通过使用全局视图阵列,计数可以自然地划分为小的全局可寻址块,这些块可放入计算节点的有限节点内存中,从而实现高度可扩展的内存和性能效率。与应用程序控制的方法相比,此方法还大大简化了可编程性。我们的实现基于将基于MPI单面通信,全局视图复原力(GVR)构建的全局视图库集成到OpenMC蒙特卡洛传输代码中。基于远程内存访问(RMA)的全局视图阵列实现能够在16,384个进程上实现85%的效率,而在Cray XC30超级计算机上,在1,366个节点上具有2.39TB网格标记的1,000个进程中,该效率达到了85%。与tally服务器方法相比,我们的结果显着提高了可伸缩性,并且比任何其他已发表的结果都要好,这表明全局视图阵列是在当前和将来的计算机系统上启用全核轻水反应堆分析的有希望的替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号