首页> 外文会议>High performance computing >Optimizing Fusion PIC Code Performance at Scale on Cori Phase Two
【24h】

Optimizing Fusion PIC Code Performance at Scale on Cori Phase Two

机译:在Cori第二阶段大规模优化Fusion PIC代码性能

获取原文
获取原文并翻译 | 示例

摘要

In this paper we present the results of optimizing the performance of the gyrokinetic full-f fusion PIC code XGC1 on the Cori Phase Two Knights Landing system. The code has undergone substantial development to enable the use of vector instructions in its most expensive kernels within the NERSC Exascale Science Applications Program. We study the single-node performance of the code on an absolute scale using the roofline methodology to guide optimization efforts. We have obtained 2x speedups in single node performance due to enabling vectorization and performing memory layout optimizations. On multiple nodes, the code is shown to scale well up to 4000 nodes, near half the size of the machine. We discuss some communication bottlenecks that were identified and resolved during the work.
机译:在本文中,我们介绍了在Cori Phase 2 Knights Landing系统上优化回旋全f融合PIC代码XGC1性能的结果。该代码经过了实质性的开发,可以在NERSC Exascale科学应用程序的最昂贵的内核中使用矢量指令。我们使用车顶线方法在绝对规模上研究代码的单节点性能,以指导优化工作。由于启用矢量化和执行内存布局优化,单节点性能提高了2倍。在多个节点上,该代码可扩展到多达4000个节点,几乎是计算机大小的一半。我们讨论了在工作中发现并解决的一些沟通瓶颈。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号