Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops

机译：通过完全暴露矢量放大循环加速晶格Boltzmann方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Lattice Boltzmann Method (LBM) plays an important role in CFD applications. Accelerating LBM computation indicates the decrease of simulation costs for many industries. However, the loop-carried dependencies in LBM kernels prevent the vectorization of loops and general compilers therefore have missed many opportunities of vectorization. This paper proposes a SIMD-aware loop transformation algorithm to fully expose vectorizable loops for LBM kernels. The proposed algorithm identifies most potential vectorizable loops according to a defined dependence table. Then, it performs appropriate loop transformations and array copying techniques to legalize loop-carried dependencies and makes the identified loops automatically vectorized by compiler. Experiments carried on an Intel Xeon Gold 6140 server show that the proposed algorithm significantly raises the ratio of number of vectorized loops to number of all loops in LBM kernels. And our algorithm also achieves a better performance than an Intel C++ compiler and a polyhedral optimizer, accelerating LBM computation by 147% and 120% on average lattice update speed, respectively.

机译：格子Boltzmann方法（LBM）在CFD应用中起重要作用。加速LBM计算表明许多行业的模拟成本降低。然而，LBM内核中的循环携带的依赖性阻止环路的矢量化和一般编译器错过了许多矢量化的机会。本文提出了一种SIMD感知环路变换算法，用于完全公开LBM内核的矢量化环。该算法根据定义的依赖表识别大多数潜在的矢量化循环。然后，它执行适当的循环变换和阵列复制技术，以合法化循环携带的依赖性，并使所识别的循环自动由编译器传染。在英特尔Xeon Gold 6140服务器上运送的实验表明，该算法显着提高了矢量化循环数量与LBM内核中所有环路数量的比率。我们的算法还实现了比英特尔C ++编译器和多面型优化器更好的性能，分别将LBM计算加速147％和120％的平均晶格更新速度。

著录项

来源
《International conference on algorithms and architectures for parallel processing》|2020年|xxii 715 p.|共15页
会议地点
作者
Bin Qu; Song Liu; Hailong Huang; Jiajun Yuan; Qian Wang; Weiguo Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机的应用;
关键词
Lattice Boltzinann Method; Auto vectorization; Performance; SIMD; Loop transformation algorithm;

机译：格子Boltzinann方法;自动矢量化;性能;SIMD;环路变换算法;

相似文献

外文文献
中文文献
专利

1. Theory of the lattice Boltzmann method: From the Boltzmann equation to the lattice Boltzmann equation [J] . Xiaoyi He, Li-Shi Luo Physical review, E. Statistical physics, plasmas, fluids, and related interdisciplinary topics . 1997,第6期

机译：格子Boltzmann方法的理论：从Boltzmann方程到格子Boltzmann方程
2. A hybrid algorithm of lattice Boltzmann method and finite difference-based lattice Boltzmann method for viscous flows [J] . Shi Xing, Huang Xianwen, Zheng Yao, International Journal for Numerical Methods in Fluids . 2017,第11期

机译：一种晶格Boltzmann方法的混合算法和基于有限差分的粘性流动晶格Boltzmann方法
3. Simulation of three-dimensional homogeneous isotropic turbulence using the moment-based lattice Boltzmann method and LES-lattice Boltzmann method [J] . Muhammad IZHAM, Tomohiro FUKUI, Koji MORINISHI Journal of Fluid Science and Technology . 2014,第4期

机译：基于矩量格Boltzmann方法和LES-晶格Boltzmann方法的三维均质各向同性湍流模拟
4. Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops [C] . Bin Qu, Song Liu, Hailong Huang, International conference on algorithms and architectures for parallel processing . 2020

机译：通过完全暴露矢量放大循环加速晶格Boltzmann方法
5. GPU accelerated study of heat transfer and fluid flow by lattice Boltzmann method on CUDA. [D] . Ren, Qinlong. 2016

机译：GPU在CUDA上通过格子Boltzmann方法加速了传热和流体流动的研究。
6. Simulations of time harmonic blood flow in the Mesenteric artery: comparing finite element and lattice Boltzmann methods [O] . Lilit Axner, Alfons G Hoekstra, Adam Jeays, 2009

机译：肠系膜动脉中时间谐波血流的模拟：有限元和格子玻尔兹曼方法的比较
7. A coupled Immersed Boundary – Lattice Boltzmann method for incompressible flows through moving porous media A coupled Immersed Boundary -Lattice Boltzmann method for incompressible flows through moving porous media [O] . Pepona Marianna, Favier Julien 2016

机译：通过流动多孔介质的不可压缩流动的耦合浸入边界-格子Boltzmann方法通过流动多孔介质的不可压缩流动的耦合浸入边界-格子Boltzmann方法
8. Theory of the Lattice Boltzmann Method: Lattice Boltzmann Models for Non-ideal Gases [R] . Luo, Li-Shi 2001

机译：格子Boltzmann方法的理论：非理想气体的格子Boltzmann模型

Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops

摘要

著录项

相似文献

相关主题

期刊订阅