首页> 外文会议>International Conference on High Performance Computing Simulation >Parallel 3D deterministic particle transport on Intel MIC architecture
【24h】

Parallel 3D deterministic particle transport on Intel MIC architecture

机译:Intel MIC架构上的并行3D确定性粒子传输

获取原文

摘要

Single-node computation speed is essential in large-scale parallel solutions of particle transport problems. The Intel Many Integrated Core (MIC) architecture supports more than 200 hardware threads as well as 512-bit double precision float-point vector operations. In this paper, we use the native model of MIC in the parallelization of the simulation of one energy group time-independent deterministic discrete ordinates particle transport in 3D Cartesian geometry (Sweep3D). The implementation adopts both hardware threads and vector units in MIC to efficiently exploit multi-level parallelism in the discrete ordinates method when keeping good data locality. Our optimized implementation is verified on target MIC and can provide up to 1.99 times speedup based on the original MPI code on Intel Xeon E5-2660 CPU when flux fixup is off. Compared with the prior on NVIDIA Tesla M2050 GPU, the speedup of up to 1.23 times is obtained. In addition, the difference between the implementations on MIC and GPU is discussed as well.
机译:单节点计算速度在粒子传输问题的大规模并行解决方案中是必不可少的。英特尔许多集成核心(MIC)架构支持超过200个硬件线程以及512位双重精度浮点向量操作。在本文中,我们在3D笛卡尔几何(Sweep3D)中的一个能量组时间独立的确定性离散坐标粒子传输的平行化中使用MIC的本地模型。该实现采用MIC中的硬件线程和向量单元,在保持良好的数据局部时,在离散坐标方法中有效地利用多级并行性。我们在目标麦克风上验证了我们的优化实现,当Flux修复器关闭时,基于Intel Xeon E5-2660 CPU上的原始MPI代码,可以提供高达1.99倍的加速。与先前的NVIDIA TESLA M2050 GPU相比,获得高达1.23倍的加速。此外,还讨论了MIC和GPU的实现之间的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号