首页> 外文期刊>Future generation computer systems >Performance and energy consumption of HPC workloads on a cluster based on Arm ThunderX2 CPU
【24h】

Performance and energy consumption of HPC workloads on a cluster based on Arm ThunderX2 CPU

机译:基于ARM Thunderx2 CPU的集群中HPC工作负载的性能和能耗

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we analyze the performance and energy consumption of an Arm-based high-performance computing (HPC) system developed within the European project Mont-Blanc 3. This system, called Dibona, has been integrated by ATOS/Bull, and it is powered by the latest Marvell's CPU, ThunderX2. This CPU is the same one that powers the Astra supercomputer, the first Arm-based supercomputer entering the Top500 in November 2018. We study from micro-benchmarks up to large production codes. We include an interdisciplinary evaluation of three scientific applications (a finite-element fluid dynamics code, a smoothed particle hydrodynamics code, and a lattice Boltzmann code) and the Graph 500 benchmark, focusing on parallel and energy efficiency as well as studying their scalability up to thousands of Armv8 cores. For comparison, we run the same tests on state-of-the-art x86 nodes included in Dibona and the Tier-0 supercomputer MareNostrum4. Our experiments show that the ThunderX2 has a 25% lower performance on average, mainly due to its small vector unit yet somewhat compensated by its 30% wider links between the CPU and the main memory. We found that the software ecosystem of the Armv8 architecture is comparable to the one available for Intel. Our results also show that ThunderX2 delivers similar or better energy-to-solution and scalability, proving that Arm-based chips are legitimate contenders in the market of next-generation HPC systems.
机译:在本文中,我们分析了欧洲项目Mont-Blanc 3中开发的ARM的高性能计算(HPC)系统的性能和能耗。该系统称为Dibona,已被Atos / Bull集成了采用最新Marvell的CPU,ThunderX2供电。这款CPU与Astra超级计算机提供支持Astra SuperComputer的CPU,该计算机11月在2018年11月进入Top500的支持。我们从微基准到大型生产代码研究。我们包括三个科学应用的跨学科评估(有限元流体动力学代码,平滑的粒子流体动力学码,格子Boltzmann代码)和图500基准,专注于平行和能效,以及研究其可扩展性成千上万的ARMv8核心。为了比较,我们对Dibona和Tier-0超级计算机Marenostrum4的最先进的X86节点进行了相同的测试。我们的实验表明,Thunderx2平均性能下降了25%,主要是由于其小型向量单元,但在CPU和主存储器之间的30%更广泛的链接中有点弥补。我们发现ARMv8架构的软件生态系统与可用于英特尔的架构相当。我们的研究结果还表明,Thunderx2提供了类似或更好的能量 - 解决方案和可扩展性,证明了基于ARM的筹码在下一代HPC系统市场中是合法的竞争者。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号