首页> 外文会议>Mathematics and Computations, Supercomputing in Nuclear Applications and Monte Carlo International Conference >CONCURRENT CPU, GPU AND MIC EXECUTION ALGORITHMS FOR ARCHER MONTE CARLO CODE INVOLVING PHOTON AND NEUTRON RADIATION TRANSPORT PROBLEMS
【24h】

CONCURRENT CPU, GPU AND MIC EXECUTION ALGORITHMS FOR ARCHER MONTE CARLO CODE INVOLVING PHOTON AND NEUTRON RADIATION TRANSPORT PROBLEMS

机译:涉及光子和中子辐射传输问题的阿彻蒙特卡洛代码的并发CPU,GPU和MIC执行算法

获取原文

摘要

ARCHER-CT and ARCHER-Neutron are Monte Carlo photon and neutron transport applications that have now been updated to utilize CPU, GPU and MIC computing devices concurrently. ARCHER detects and simultaneously utilizes all CPU, GPU and MIC processing devices that are available. A different device layout and load-balancing algorithm is implemented for each Monte Carlo transport application. ARCHER-CT utilizes a new "self service" approach that efficiently and effectively allows each device to independently grab portions of the domain and compute concurrently until the entire CT phantom domain has been simulated. ARCHER-Neutron uses a dynamic load-balancing algorithm that distributes the particles in each batch to each device based on its particles per second rate for the previous batch. This algorithm allows multiple architectures and devices to execute concurrently. A near linear scaling speedup is observed when using only GPU devices concurrently. New timing benchmarks using various combinations of various Intel and NVIDIA devices are made and presented for each application. A speedup of 16× for ARCHER-Neutron and 44× for ARCHER-CT has been observed when utilizing an entire 4U, 9 device heterogeneous computing system composed of an Intel CPU, an Intel MIC and 7 NVIDIA GPUs.
机译:ARCHER-CT和ARCHER-Neutron是蒙特卡罗光子和中子传输应用程序,现已更新,可以同时利用CPU,GPU和MIC计算设备。 ARCHER检测并同时利用所有可用的CPU,GPU和MIC处理设备。为每个蒙特卡洛运输应用程序实现了不同的设备布局和负载平衡算法。 ARCHER-CT利用新的“自助服务”方法,该方法有效且有效地允许每个设备独立捕获域的各个部分并同时进行计算,直到模拟完整个CT幻像域为止。 ARCHER-Neutron使用动态负载平衡算法,该算法基于上一批次的每秒颗粒数速率将每个批次中的颗粒分配到每个设备。该算法允许多个架构和设备同时执行。当同时使用GPU设备时,观察到接近线性的缩放速度。针对每种应用,使用各种英特尔和NVIDIA设备的各种组合制定了新的时序基准。当使用由Intel CPU,Intel MIC和7个NVIDIA GPU组成的整个4U,9设备异构计算系统时,已观察到ARCHER-Neutron的速度是16倍,ARCHER-CT的速度是44倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号