OpenCL-Based FPGA Design to Accelerate the Nodal Discontinuous Galerkin Method for Unstructured Meshes

机译：基于OpenCL的FPGA设计可加速非结构化网格的节点间断Galerkin方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The exploration of FPGAs as accelerators for scientific simulations has so far mostly been focused on small kernels of methods working on regular data structures, for example in the form of stencil computations for finite difference methods. In computational sciences, often more advanced methods are employed that promise better stability, convergence, locality and scaling. Unstructured meshes are shown to be more effective and more accurate, compared to regular grids, in representing computation domains of various shapes. Using unstructured meshes, the discontinuous Galerkin method preserves the ability to perform explicit local update operations for simulations in the time domain. In this work, we investigate FPGAs as target platform for an implementation of the nodal discontinuous Galerkin method to find time-domain solutions of Maxwell's equations in an unstructured mesh. When maximizing data reuse and fitting constant coefficients into suitably partitioned on-chip memory, high computational intensity allows us to implement and feed wide data paths with hundreds of floating point operators. By decoupling off-chip memory accesses from the computations, high memory bandwidth can be sustained, even for the irregular access pattern required by parts of the application. Using the Intel/Altera OpenCL SDK for FPGAs, we present different implementation variants for different polynomial orders of the method. In different phases of the algorithm, either computational or bandwidth limits of the Arria 10 platform are almost reached, thus outperforming a highly multithreaded CPU implementation by around 2x.

机译：迄今为止，对作为科学模拟加速器的FPGA的探索主要集中在处理常规数据结构的方法的小内核上，例如以有限差分方法的模板计算形式。在计算科学中，通常采用更先进的方法，以保证更好的稳定性，收敛性，局部性和缩放性。与常规网格相比，非结构化网格在表示各种形状的计算域方面显示出了更高的效率和准确性。使用非结构化网格，不连续的Galerkin方法保留了在时域中执行显式本地更新操作的能力。在这项工作中，我们将FPGA作为目标平台，以实现节点不连续Galerkin方法的实现，以在非结构化网格中找到Maxwell方程的时域解。当最大化数据重用并将常数系数拟合到适当划分的片上存储器中时，高的计算强度使我们能够利用数百个浮点运算符来实现和馈送宽数据路径。通过将片外存储器访问与计算解耦，即使对于应用程序某些部分所需的不规则访问模式，也可以维持较高的存储器带宽。使用用于FPGA的Intel / Altera OpenCL SDK，我们为方法的不同多项式阶数提供了不同的实现变体。在算法的不同阶段，几乎可以达到Arria 10平台的计算或带宽限制，因此比高度多线程的CPU实现高出大约2倍。

著录项

来源
《IEEE Annual International Symposium on Field-Programmable Custom Computing Machines》|2018年|189-196|共8页
会议地点
作者
Tobias Kenter; Gopinath Mahale; Samer Alhaddad; Yevgen Grynko; Christian Schmitt; Ayesha Afzal; Frank Hannig; Jens Förstner; Christian Plessl;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Field programmable gate arrays; Kernel; Method of moments; Time-domain analysis; Acceleration; Computational modeling; Mathematical model;

机译：现场可编程门阵列;核;矩量法;时域分析;加速度;计算建模;数学模型;

相似文献

外文文献
中文文献
专利

1. An entropy stable nodal discontinuous Galerkin method for the two dimensional shallow water equations on unstructured curvilinear meshes with discontinuous bathymetry [J] . Wintermeyer Niklas, Winters Andrew R., Gassner Gregor J., Journal of Computational Physics . 2017,第期

机译：一种熵稳定的节点不连续的Galerkin方法，用于不连续沐浴的非结构化曲线网眼的二维浅水方程
2. A new vertex-based limiting approach for nodal discontinuous Galerkin methods on arbitrary unstructured meshes [J] . Longxiang Li, Qinghe Zhang Computers & Fluids . 2017,第期

机译：对任意非结构网格节点间断有限元方法的一个新的基于顶点限制性手段
3. Krylov implicit integration factor methods for spatial discretization on high dimensional unstructured meshes: Application to discontinuous Galerkin methods [J] . Chen S., Zhang Y.-T. Journal of Computational Physics . 2011,第11期

机译：高维非结构化网格上空间离散化的Krylov隐式积分因子方法：在不连续Galerkin方法中的应用
4. OpenCL-Based FPGA Design to Accelerate the Nodal Discontinuous Galerkin Method for Unstructured Meshes [C] . Tobias Kenter, Gopinath Mahale, Samer Alhaddad, IEEE Annual International Symposium on Field-Programmable Custom Computing Machines . 2018

机译：基于OpenCL的FPGA设计，可加速Nodal不连续Galerkin方法的非结构化网格
5. GPU-Accelerated Discontinuous Galerkin Methods on Hybrid Meshes: Applications in Seismic Imaging [D] . Wang, Zheng. 2017

机译：混合网格上GPU加速的不连续Galerkin方法：在地震成像中的应用
6. Comparison of reduced models for blood flow using Runge–Kutta discontinuous Galerkin methods [O] . Charles Puelz, Sunčica Čanić, Béatrice Rivière, -1

机译：使用Runge-Kutta不连续Galerkin方法简化的血流模型的比较
7. An Entropy Stable Nodal Discontinuous Galerkin Method for the Two Dimensional Shallow Water Equations on Unstructured Curvilinear Meshes with Discontinuous Bathymetry [O] . Niklas Wintermeyera, Andrew R. Wintersa, Gregor J. Gassnera, 2016

机译：不连续水深测量非结构曲线网格上二维浅水方程的熵稳定节点间断Galerkin方法

OpenCL-Based FPGA Design to Accelerate the Nodal Discontinuous Galerkin Method for Unstructured Meshes

摘要

著录项

相似文献

相关主题

期刊订阅