首页> 外文会议>Asian Supercomputing Conference >Acceleration of Wind Simulation Using Locally Mesh-Refined Lattice Boltzmann Method on GPU-Rich Supercomputers
【24h】

Acceleration of Wind Simulation Using Locally Mesh-Refined Lattice Boltzmann Method on GPU-Rich Supercomputers

机译:在富含GPU的超级计算机上使用局部网格精炼晶格Boltzmann方法加速风仿真

获取原文

摘要

A real-time simulation of the environmental dynamics of radioactive substances is very important from the viewpoint of nuclear security. Since airflows in large cities are turbulent with Reynolds numbers of several million, large-scale CFD simulations are needed. We developed a CFD code based on the adaptive mesh-refined Lattice Boltzmann Method (AMR-LBM). AMR method arranges fine grids in a necessary region, so that we can realize a high-resolution analysis including a global simulation area. The code is developed on the GPU-rich supercomputer TSUBAME3.0 at the Tokyo Tech, and the GPU kernel functions are tuned to achieve high performance on the Pascal GPU architecture. The code is validated against a wind tunnel experiment which was released from the National Institute of Advanced Industrial Science and Technology in Japan Thanks to the AMR method, the total number of grid points is reduced to less than 10% compared to the fine uniform grid system. The performances of weak scaling from 1 nodes to 36 nodes are examined. The GPUs (NVIDIA TESLA P100) achieved more than 10 times higher node performance than that of CPUs (Broadwell).
机译:从核安全的观点来看,放射性物质环境动态的实时模拟非常重要。由于大城市的气流是湍流的雷诺数数百万的数量,因此需要大规模的CFD模拟。我们开发了基于自适应网格精炼晶格Boltzmann方法(AMR-LBM)的CFD代码。 AMR方法在必要的区域中排列细网,以便我们可以实现包括全局模拟区域的高分辨率分析。在东京技术的GPU的超级计算机Tsubame3.0上开发了代码,并调整GPU内核功能,以在Pascal GPU架构上实现高性能。由于AMR方法,从日本国家先进的工业科学和技术研究所释放的风洞实验验证了代码,相比,与精细均匀的网格系统相比,网格点总数降至小于10% 。检查从1个节点到36个节点的弱缩放的性能。 GPU(NVIDIA TESLA P100)实现了比CPU(Broadwell)更高的节点性能的10倍以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号