首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA
【24h】

High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA

机译:富含GPU的超级计算机上的高生产率框架,用于运行天气预报代码ASUCA

获取原文

摘要

The weather prediction code demands large computational performance to achieve fast and high-resolution simulations. Skillful programming techniques are required for obtaining good parallel efficiency on GPU supercomputers. Our framework-based weather prediction code ASUCA has achieved good scalability with hiding complicated implementation and optimizations required for distributed GPUs, contributing to increasing the maintainability, ASUCA is a next-generation high resolution meso-scale atmospheric model being developed by the Japan Meteorological Agency. Our framework automatically translates user-written stencil functions that update grid points and generates both GPU and CPU codes. User-written codes are parallelized by MPI with intra-node GPU peer-to-peer direct access. These codes can easily utilize optimizations such as overlapping technique to hide communication overhead by computation. Our simulations on the GPU-rich supercomputer TSUBAME 2.5 at the Tokyo Institute of Technology have demonstrated good strong and weak scalability achieving 209.6 TFlops in single precision for our largest model using 4,108 NVIDIA K20X GPUs.
机译:天气预报代码需要大的计算性能来实现快速和高分辨率的模拟。在GPU超级计算机上获得良好的并行效率,需要熟练的编程技术。我们基于框架的天气预测码ASUCA已经实现了良好的可扩展性,并且覆盖了分布式GPU所需的隐藏复杂的实施和优化,有助于提高可维护性,ASUCA是日本气象学机构开发的下一代高分辨率中型大气模型。我们的框架自动转换用户写入的模板功能,可更新网格点并生成GPU和CPU代码。用户写入代码由MPI与No.Otn-No.GPU对等直接访问并行化。这些代码可以很容易地利用优化,例如重叠技术来通过计算隐藏通信开销。我们对东京工业大学的GPU丰富的超级计算机Tsubame 2.5的模拟表现出使用4,108 NVIDIA K20X GPU的最大模型的单一精度良好的强大和弱可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号