...
首页> 外文期刊>Computer physics communications >A fast and efficient integration of boundary conditions into a unified CUDA Kernel for a shallow water solver lattice Boltzmann Method
【24h】

A fast and efficient integration of boundary conditions into a unified CUDA Kernel for a shallow water solver lattice Boltzmann Method

机译:边界条件的快速有效地集成到浅水溶剂晶格玻璃螺栓法的统一CUDA内核中

获取原文
获取原文并翻译 | 示例
           

摘要

In this work, we present an exhaustive performance analysis of the integration of boundary conditions in a unified CUDA kernel for a lattice Boltzmann shallow water solver. This kernel is implemented under the pull scheme approach of the lattice Boltzmann method. The analysis is performed simulating open ocean domains with open and bounce-back boundary conditions. Boundary conditions treatment is divided in two steps: identification of the classes of the distribution function components in a node and branching handling. Several methods are proposed for each step, and all the combinations of them are tested with different hardware, domain size and floating point precision. Results show that high performance is achieved when using two binary precomputed values for class identification, while handling branchings with Boolean multiplication should be avoided. A full report of the MLUPS (Millions of Lattice Updates Per Second) ratio achieved with each test is presented. (C) 2019 Elsevier B.V. All rights reserved.
机译:在这项工作中,我们对统一的CUDA内核的整合进行了详尽的性能分析,用于镶嵌晶格Boltzmann浅水求解器。该内核是在格子Boltzmann方法的拉动方案方法下实现的。通过打开和反弹边界条件模拟开放海域的分析。边界条件处理分为两个步骤:识别节点和分支处理中的分发功能组件的类。为每个步骤提出了几种方法,并以不同的硬件,域大小和浮点精度测试它们的所有组合。结果表明,使用两个二进制预先计算的类识别值时,实现了高性能,同时应避免使用布尔乘法的处理分支。提出了每次测试实现的MLUP的完整报告(每秒数百万个晶格更新)比率。 (c)2019年Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号