首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >10M-Core Scalable Fully-Implicit Solver for Nonhydrostatic Atmospheric Dynamics
【24h】

10M-Core Scalable Fully-Implicit Solver for Nonhydrostatic Atmospheric Dynamics

机译:用于非静压大气动力学的10M核可扩展全隐式求解器

获取原文

摘要

An ultra-scalable fully-implicit solver is developed for stiff time-dependent problems arising from the hyperbolic conservation laws in nonhydrostatic atmospheric dynamics. In the solver, we propose a highly efficient hybrid domain-decomposed multigrid preconditioner that can greatly accelerate the convergence rate at the extreme scale. For solving the overlapped subdomain problems, a geometry-based pipelined incomplete LU factorization method is designed to further exploit the on-chip fine-grained concurrency. We perform systematic optimizations on different hardware levels to achieve best utilization of the heterogeneous computing units and substantial reduction of data movement cost. The fully-implicit solver successfully scales to the entire system of the Sunway TaihuLight supercomputer with over 10.5M heterogeneous cores, sustaining an aggregate performance of 7.95 PFLOPS in double-precision, and enables fast and accurate atmospheric simulations at the 488-m horizontal resolution (over 770 billion unknowns) with 0.07 simulated-years-per-day. This is, to our knowledge, the largest fully-implicit simulation to date.
机译:针对非静力学大气动力学中双曲守恒律引起的时间相关的刚性问题,开发了一种超可扩展的全隐式求解器。在求解器中,我们提出了一种高效的混合域分解多网格预处理器,该预处理器可以极大地加快极端规模下的收敛速度。为了解决重叠子域问题,设计了基于几何的流水线不完全LU分解方法,以进一步利用片上细粒度并发。我们在不同的硬件级别上执行系统优化,以实现异构计算单元的最佳利用并大幅降低数据移动成本。完全隐式求解器可成功扩展到具有超过10.5M异构核的Sunway TaihuLight超级计算机的整个系统,以双精度维持7.95 PFLOPS的综合性能,并能够在488 m的水平分辨率下进行快速而准确的大气模拟(超过7700亿个未知数),每天的模拟时间为0.07。据我们所知,这是迄今为止最大的完全隐式仿真。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号