首页> 外文会议>International Conference on Field Programmable Logic and Applications >NERO: A Near High-Bandwidth Memory Stencil Accelerator for Weather Prediction Modeling
【24h】

NERO: A Near High-Bandwidth Memory Stencil Accelerator for Weather Prediction Modeling

机译:NERO:用于天气预报建模的近高带宽内存模具加速器

获取原文

摘要

Ongoing climate change calls for fast and accurate weather and climate modeling. However, when solving large-scale weather prediction simulations, state-of-the-art CPU and GPU implementations suffer from limited performance and high energy consumption. These implementations are dominated by complex irregular memory access patterns and low arithmetic intensity that pose fundamental challenges to acceleration. To overcome these challenges, we propose and evaluate the use of near-memory acceleration using a reconfigurable fabric with high-bandwidth memory (HBM). We focus on compound stencils that are fundamental kernels in weather prediction models. By using high-level synthesis techniques, we develop NERO, an FPGA+HBM-based accelerator connected through IBM CAPI2 (Coherent Accelerator Processor Interface) to an IBM POWER9 host system. Our experimental results show that NERO outperforms a 16-core POWER9 system by 4.2x and 8.3x when running two different compound stencil kernels. NERO reduces the energy consumption by 22x and 29x for the same two kernels over the POWER9 system with an energy efficiency of 1.5 GFLOPS/Watt and 17.3 GFLOPS/Watt. We conclude that employing near-memory acceleration solutions for weather prediction modeling is promising as a means to achieve both high performance and high energy efficiency.
机译:持续的气候变化要求快速而准确的天气和气候模拟。但是,当解决大规模天气预报模拟时,最新的CPU和GPU实现会受到性能限制和高能耗的困扰。这些实现方式主要由复杂的不规则内存访问模式和较低的算术强度构成,这对加速提出了根本性的挑战。为了克服这些挑战,我们提出并评估了使用具有高带宽内存(HBM)的可重新配置结构的近内存加速的使用。我们专注于复合模板,这些模板是天气预报模型中的基本内核。通过使用高级综合技术,我们开发了NERO,这是一种基于FPGA + HBM的加速器,通过IBM CAPI2(相干加速器处理器接口)连接到IBM POWER9主机系统。我们的实验结果表明,运行两个不同的复合模板内核时,NERO的性能比16核POWER9系统高4.2倍和8.3倍。与POWER9系统相比,对于两个相同的内核,NERO分别将能耗降低了22倍和29倍,能耗分别为1.5 GFLOPS /瓦和17.3 GFLOPS /瓦。我们得出的结论是,采用近内存加速解决方案进行天气预报建模有望成为实现高性能和高能效的一种手段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号