首页> 外文会议>International conference on the physics of reactors >A MONTE CARLO NEUTRON TRANSPORT CODE FOR EIGENVALUE CALCULATIONS ON A DUAL-GPU SYSTEM AND CUDA ENVIRONMENT
【24h】

A MONTE CARLO NEUTRON TRANSPORT CODE FOR EIGENVALUE CALCULATIONS ON A DUAL-GPU SYSTEM AND CUDA ENVIRONMENT

机译:用于双GPU系统和CUDA环境的特征值计算的蒙特卡罗中子传输码

获取原文

摘要

Monte Carlo (MC) method is able to accurately calculate eigenvalues in reactor analysis. Its lengthy computation time can be reduced by general-purpose computing on Graphics Processing Units (GPU), one of the latest parallel computing techniques under development. The method of porting a regular transport code to GPU is usually very straightforward due to the "embarrassingly parallel" nature of MC code. However, the situation becomes different for eigenvalue calculation in that it will be performed on a generation-by-generation basis and the thread coordination should be explicitly taken care of. This paper presents our effort to develop such a GPU-based MC code in Compute Unified Device Architecture (CUDA) environment. The code is able to perform eigenvalue calculation under simple geometries on a multi-GPU system. The specifics of algorithm design, including thread organization and memory management were described in detail. The original CPU version of the code was tested on an Intel Xeon X5660 2.8GHz CPU, and the adapted GPU version was tested on NVIDIA Tesla M2090 GPUs. Double-precision floating point format was used throughout the calculation. The result showed that a speedup of 7.0 and 33.3 were obtained for a bare spherical core and a binary slab system respectively. The speedup factor was further increased by a factor of ~2 on a dual GPU system. The upper limit of device-level parallelism was analyzed, and a possible method to enhance the thread-level parallelism was proposed.
机译:蒙特卡罗(MC)方法能够精确地计算反应器分析中的特征值。通过在图形处理单元(GPU)上的通用计算,可以减少其冗长的计算时间,其中一个正在开发的最新并行计算技术之一。由于MC代码的“令人尴尬的平行”性质,将常规传输代码移植到GPU的方法通常非常简单。然而,对于特征值计算的情况,情况变得不同,因为它将在发电的基础上进行,并且应明确地处理线程协调。本文介绍了在计算统一设备架构(CUDA)环境中开发这种基于GPU的MC代码。该代码能够在多GPU系统上的简单几何形状下执行特征值计算。详细描述了算法设计的细节,包括线程组织和内存管理。代码的原始CPU版本在Intel Xeon X5660 2.8GHz CPU上测试了,在NVIDIA Tesla M2090 GPU上测试了适应的GPU版本。在整个计算中使用了双精度浮点格式。结果表明,分别为裸球芯和二元板系统获得7.0和33.3的加速。在双GPU系统上进一步增加了加速因子〜2的因子。分析了装置级并行性的上限,提出了一种提高螺纹平行度的可能方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号