Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops

机译：通过充分暴露向量化环来加速格子Boltzmann方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Lattice Boltzmann Method (LBM) plays an important role in CFD applications. Accelerating LBM computation indicates the decrease of simulation costs for many industries. However, the loop-carried dependencies in LBM kernels prevent the vectorization of loops and general compilers therefore have missed many opportunities of vectorization. This paper proposes a SIMD-aware loop transformation algorithm to fully expose vectorizable loops for LBM kernels. The proposed algorithm identifies most potential vectorizable loops according to a defined dependence table. Then, it performs appropriate loop transformations and array copying techniques to legalize loop-carried dependencies and makes the identified loops automatically vectorized by compiler. Experiments carried on an Intel Xeon Gold 6140 server show that the proposed algorithm significantly raises the ratio of number of vectorized loops to number of all loops in LBM kernels. And our algorithm also achieves a better performance than an Intel C++ compiler and a polyhedral optimizer, accelerating LBM computation by 147% and 120% on average lattice update speed, respectively.

机译：格子玻尔兹曼方法（LBM）在CFD应用中起着重要作用。加快LBM计算表明许多行业的仿真成本降低了。但是，LBM内核中的循环承载依赖性阻止了循环的矢量化，因此通用编译器已经错过了许多矢量化的机会。本文提出了一种SIMD感知循环转换算法，以充分展示LBM内核的矢量化循环。所提出的算法根据定义的依赖表来识别最可能的矢量化循环。然后，它执行适当的循环转换和数组复制技术，以使循环承载的依赖关系合法化，并使所标识的循环由编译器自动向量化。在Intel Xeon Gold 6140服务器上进行的实验表明，该算法大大提高了LBM内核中矢量化循环数与所有循环数之比。而且，我们的算法还比Intel C ++编译器和多面体优化器具有更好的性能，分别使平均矩阵更新速度的LBM计算分别提高了147％和120％。

著录项

来源
《International conference on algorithms and architectures for parallel processing》|2019年|107-121|共15页
会议地点
作者
Bin Qu; Song Liu; Hailong Huang; Jiajun Yuan; Qian Wang; Weiguo Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Lattice Boltzinann Method; Auto vectorization; Performance; SIMD; Loop transformation algorithm;

机译：格子Boltzinann方法;自动矢量化;性能; SIMD;循环变换算法;

相似文献

外文文献
中文文献
专利

1. Theory of the lattice Boltzmann method: From the Boltzmann equation to the lattice Boltzmann equation [J] . Xiaoyi He, Li-Shi Luo Physical review, E. Statistical physics, plasmas, fluids, and related interdisciplinary topics . 1997,第6期

机译：格子Boltzmann方法的理论：从Boltzmann方程到格子Boltzmann方程
2. A hybrid algorithm of lattice Boltzmann method and finite difference-based lattice Boltzmann method for viscous flows [J] . Shi Xing, Huang Xianwen, Zheng Yao, International Journal for Numerical Methods in Fluids . 2017,第11期

机译：一种晶格Boltzmann方法的混合算法和基于有限差分的粘性流动晶格Boltzmann方法
3. Simulation of three-dimensional homogeneous isotropic turbulence using the moment-based lattice Boltzmann method and LES-lattice Boltzmann method [J] . Muhammad IZHAM, Tomohiro FUKUI, Koji MORINISHI Journal of Fluid Science and Technology . 2014,第4期

机译：基于矩量格Boltzmann方法和LES-晶格Boltzmann方法的三维均质各向同性湍流模拟
4. Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops [C] . Bin Qu, Song Liu, Hailong Huang, International conference on algorithms and architectures for parallel processing . 2020

机译：通过完全暴露矢量放大循环加速晶格Boltzmann方法
5. GPU accelerated study of heat transfer and fluid flow by lattice Boltzmann method on CUDA. [D] . Ren, Qinlong. 2016

机译：GPU在CUDA上通过格子Boltzmann方法加速了传热和流体流动的研究。
6. Simulations of time harmonic blood flow in the Mesenteric artery: comparing finite element and lattice Boltzmann methods [O] . Lilit Axner, Alfons G Hoekstra, Adam Jeays, 2009

机译：肠系膜动脉中时间谐波血流的模拟：有限元和格子玻尔兹曼方法的比较
7. A coupled Immersed Boundary – Lattice Boltzmann method for incompressible flows through moving porous media A coupled Immersed Boundary -Lattice Boltzmann method for incompressible flows through moving porous media [O] . Pepona Marianna, Favier Julien 2016

机译：通过流动多孔介质的不可压缩流动的耦合浸入边界-格子Boltzmann方法通过流动多孔介质的不可压缩流动的耦合浸入边界-格子Boltzmann方法
8. Theory of the Lattice Boltzmann Method: Lattice Boltzmann Models for Non-ideal Gases [R] . Luo, Li-Shi 2001

机译：格子Boltzmann方法的理论：非理想气体的格子Boltzmann模型

Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops

摘要

著录项

相似文献

相关主题

期刊订阅