Parallelizing and optimizing large-scale 3D multi-phase flow simulations on the Tianhe-2 supercomputer

Li Dali; Xu Chuanfu; Wang Yongxian; Song Zhifang; Xiong Min; Gao Xiang; Deng Xiaogang

首页> 外文期刊>Concurrency and computation: practice and experience >Parallelizing and optimizing large-scale 3D multi-phase flow simulations on the Tianhe-2 supercomputer

【24h】

Parallelizing and optimizing large-scale 3D multi-phase flow simulations on the Tianhe-2 supercomputer

机译：在天河2号超级计算机上并行化和优化大规模3D多相流模拟

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The lattice Boltzmann method (LBM) is a widely used computational fluid dynamics method for flow problems with complex geometries and various boundary conditions. Large-scale LBM simulations with increasing resolution and extending temporal range require massive high-performance computing (HPC) resources, thus motivating us to port it onto modern many-core heterogeneous supercomputers like Tianhe-2. Although many-core accelerators such as graphics processing unit and Intel MIC have a dramatic advantage of floating-point performance and power efficiency over CPUs, they also pose a tough challenge to parallelize and optimize computational fluid dynamics codes on large-scale heterogeneous system.In this paper, we parallelize and optimize the open source 3D multi-phase LBM code openlbmflow on the Intel Xeon Phi (MIC) accelerated Tianhe-2 supercomputer using a hybrid and heterogeneous MPI+OpenMP+Offload+single instruction, mulitple data (SIMD) programming model. With cache blocking and SIMD-friendly data structure transformation, we dramatically improve the SIMD and cache efficiency for the single-thread performance on both CPU and Phi, achieving a speedup of 7.9X and 8.8X, respectively, compared with the baseline code. To collaborate CPUs and Phi processors efficiently, we propose a load-balance scheme to distribute workloads among intra-node two CPUs and three Phi processors and use an asynchronous model to overlap the collaborative computation and communication as far as possible. The collaborative approach with two CPUs and three Phi processors improves the performance by around 3.2X compared with the CPU-only approach. Scalability tests show that openlbmflow can achieve a parallel efficiency of about 60% on 2048 nodes, with about 400K cores in total. To the best of our knowledge, this is the largest scale CPU-MIC collaborative LBM simulation for 3D multi-phase flow problems. Copyright © 2015 John Wiley & Sons, Ltd.

机译：格子玻尔兹曼方法（LBM）是一种广泛使用的计算流体动力学方法，用于解决具有复杂几何形状和各种边界条件的流动问题。分辨率提高，时间范围扩大的大规模LBM仿真需要大量的高性能计算（HPC）资源，因此促使我们将其移植到现代多核异构超级计算机（如Tianhe-2）上。尽管图形处理器和Intel MIC之类的多核加速器在浮点性能和电源效率方面都比CPU具有显着优势，但它们在并行化和优化大规模异构系统上的计算流体动力学代码方面也面临着艰巨的挑战。本文我们使用混合和异构MPI + OpenMP + Offload +单指令，多数据（SIMD）编程在Intel Xeon Phi（MIC）加速的Tianhe-2超级计算机上并行化和优化了开源3D多相LBM代码openlbmflow模型。通过缓存阻止和SIMD友好的数据结构转换，我们显着提高了CPU和Phi上单线程性能的SIMD和缓存效率，与基准代码相比，分别提高了7.9倍和8.8倍。为了有效地协作CPU和Phi处理器，我们提出了一种负载均衡方案，以在节点内的两个CPU和三个Phi处理器之间分配工作负载，并使用异步模型来尽可能多地重叠协作计算和通信。与仅使用CPU的方法相比，具有两个CPU和三个Phi处理器的协作方法将性能提高了约3.2倍。可伸缩性测试表明，openlbmflow在2048个节点上可实现约60％的并行效率，总共有约40万个内核。据我们所知，这是针对3D多相流问题的最大规模的CPU-MIC协作LBM仿真。版权所有©2015 John Wiley＆Sons，Ltd.

著录项

来源
《Concurrency and computation: practice and experience》 |2016年第5期|1678-1692|共15页
作者
Li Dali; Xu Chuanfu; Wang Yongxian; Song Zhifang; Xiong Min; Gao Xiang; Deng Xiaogang;
展开▼
作者单位

National University of Defense Technology College of Computer ChangSha China;

National University of Defense Technology College of Computer ChangSha China;

National University of Defense Technology National Laboratory for Parallel and Distributed Processing ChangSha China;

National University of Defense Technology College of Computer ChangSha China;

National University of Defense Technology National Laboratory for Parallel and Distributed Processing ChangSha China;

National University of Defense Technology College of Computer ChangSha China;

National University of Defense Technology College of Computer ChangSha China;

National University of Defense Technology College of Computer ChangSha China;

National University of Defense Technology ChangSha China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
heterogeneous system; intel xeon phi; Tianhe‐2; multi‐phase flow; LBM;

机译：异构系统英特尔至强phi天河2多相流LBM;

相似文献

外文文献
中文文献
专利

1. High-Scalable Collaborated Parallel Framework for Large-Scale Molecular Dynamic Simulation on Tianhe-2 Supercomputer [J] . Peng Shaoliang, Zhang Xiaoyu, Su Wenhe, IEEE/ACM transactions on computational biology and bioinformatics . 2020,第3期

机译：天河2超级计算机上大规模分子动态模拟的高可扩展合作平行框架
2. High-Scalable Collaborated Parallel Framework for Large-Scale Molecular Dynamic Simulation on Tianhe-2 Supercomputer [J] . Current Organic Synthesis . 2020,第3期

机译：天河2超级计算机上大规模分子动态仿真的高可扩展合作平行框架
3. Hybrid parallel framework for multiple-point geostatistics on Tianhe-2: A robust solution for large-scale simulation [J] . Cui Zhesi, Chen Qiyu, Liu Gang, Computers & geosciences . 2021,第Deca期

机译：Tianhe-2多点地统计学的混合并联框架：大规模模拟的强大解决方案
4. Parallel Implementation and Optimizations of Visibility Computing of 3D Scene on Tianhe-2 Supercomputer [C] . Zhengwei Xu, Xiaodong Wang, Congpin Zhang, International conference on algorithms and architectures for parallel processing . 2018

机译：天河2号超级计算机3D场景可视化计算的并行实现与优化。
5. Large-Scale Simulations of Complex Turbulent Flows: Modulation of Turbulent Boundary Layer Separation and Optimization of Discontinuous Galerkin Methods for Next-Generation HPC Platforms [D] . Tandon, Suyash. 2020

机译：复杂湍流的大规模模拟：湍流边界层分离调制，下一代HPC平台不连续Galerkin方法的优化
6. Design Optimization for Accurate Flow Simulations in 3D PrintedVascular Phantoms Derived from Computed Tomography Angiography [O] . Kelsey Sommer, Richard L Izzo, Lauren Shepard, -1

机译：在3D打印中进行精确流仿真的设计优化计算机断层扫描血管造影术衍生的血管幻影
7. Large-scale Simulations of 3D Groundwater Flow Using Parallel Geometric Multigrid Method [O] . Nakajima Kengo 2013

机译：并行几何多重网格方法对3D地下水流的大规模模拟
8. Large-Scale Simulation of Beam Dynamics in High Intensity Ion Linacs UsingParallel Supercomputers [R] . Ryne, R., Qiang, J. 2000

机译：用并行超级计算机大规模模拟高强度离子直线加速器的光束动力学

Parallelizing and optimizing large-scale 3D multi-phase flow simulations on the Tianhe-2 supercomputer

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅