Multi-FPGA Accelerator for Scalable Stencil Computation with Constant Memory Bandwidth

Sano K.; Hatsuda Y.; Yamamoto S.

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Multi-FPGA Accelerator for Scalable Stencil Computation with Constant Memory Bandwidth

【24h】

Multi-FPGA Accelerator for Scalable Stencil Computation with Constant Memory Bandwidth

机译：具有恒定存储器带宽的可扩展模板计算的多FPGA加速器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Stencil computation is one of the important kernels in scientific computations. However, sustained performance is limited owing to restriction on memory bandwidth, especially on multicore microprocessors and graphics processing units (GPUs) because of their small operational intensity. In this paper, we present a custom computing machine (CCM), called a scalable streaming-array (SSA), for high-performance stencil computations with multiple field-programmable gate arrays (FPGAs). We design SSA based on a domain-specific programmable concept, where CCMs are programmable with the minimum functionality required for an algorithm domain. We employ a deep pipelining approach over successive iterations to achieve linear scalability for multiple devices with a constant memory bandwidth. Prototype implementation using nine FPGAs demonstrates good agreement with a performance model, and achieves 260 and 236 GFlop/s for 2D and 3D Jacobi computation, which are 87.4 and 83.9 percent of the peak, respectively, with a memory bandwidth of only 2.0 GB/s. We also evaluate the performance of SSA for state-of-the-art FPGAs.

机译：模板计算是科学计算中的重要内核之一。但是，由于内存带宽的限制，尤其是多核微处理器和图形处理单元（GPU）的运行强度较低，因此持续性能受到限制。在本文中，我们提出了一种称为可伸缩流阵列（SSA）的自定义计算机（CCM），用于使用多个现场可编程门阵列（FPGA）进行高性能的模板计算。我们基于特定领域的可编程概念设计SSA，其中CCM可以使用算法域所需的最小功能进行编程。我们在连续迭代中采用深度流水线方法，以实现具有恒定内存带宽的多个设备的线性可扩展性。使用9个FPGA的原型实现证明与性能模型具有良好的一致性，并且2D和3D Jacobi计算分别达到260和236 GFlop / s，分别是峰值的87.4和83.9％，而存储器带宽仅为2.0 GB / s 。我们还评估了最新FPGA的SSA的性能。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2014年第3期|695-705|共11页
作者
Sano K.; Hatsuda Y.; Yamamoto S.;
展开▼
作者单位

Grad. Sch. of Inf. Sci., Tohoku Univ., Sendai, Japan|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
FPGA; Scalable streaming-array; custom computing machine; high-performance computation; stencil computation;

机译：FPGA;可扩展流阵列;定制计算机;高性能计算;模板计算;

相似文献

外文文献
中文文献
专利

1. A Memory Profiling Framework for Stencil Computation on an FPGA Accelerator with High Level Synthesis [J] . Rie Soejima, Koji Okina, Keisuke Dohi, Computer architecture news . 2014,第4期

机译：具有高级综合功能的FPGA加速器上用于模板计算的存储器性能分析框架
2. PACC: a directive-based programming framework for out-of-core stencil computation on accelerators [J] . Nobuhiro Miki, Fumihiko Ino, Kenichi Hagihara International Journal of High Performance Computing and Networking . 2019,第1期

机译：PACC：基于指令的加速器上的核心模板计算的指令编程框架
3. Performance Evaluation of a 3D-Stencil Library for Distributed Memory Array Accelerators [J] . Yoshikazu INAGAKI, Shinya TAKAMAEDA-YAMAZAKI, Jun YAO, IEICE transactions on information and systems . 2015,第12期

机译：分布式内存阵列加速器的3D模具库的性能评估
4. Scalable Streaming-Array of Simple Soft-Processors for Stencil Computations with Constant Memory-Bandwidth [C] . Sano Kentaro, Hatsuda Yoshiaki, Yamamoto Satoru 2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines . 2011

机译：简单软处理器的可扩展流阵列，用于具有恒定内存带宽的模具计算
5. Revisiting the memory hierarchy in the many-core era: Computation is cheap, bandwidth is everything. [D] . Rajasekhar, Yamuna. 2014

机译：重新审视多核时代的内存层次结构：计算便宜，带宽就是一切。
6. Implementation of Constant Dose Rate and Constant Angular Spacing Intensity-modulated Arc Therapy for Cervical Cancer by Using a Conventional Linear Accelerator [O] . Ruo-Hui Zhang, Xiao-Mei Fan, Wen-Wen Bai, 2016

机译：使用常规线性加速器实现宫颈癌恒定剂量率和恒定角间距强度调节弧光治疗
7. Multi-FPGA Accelerator Architecture for Stencil Computation Exploiting Spacial and Temporal Scalability [O] . Hasitha Muthumala Waidyasooriya, Masanori Hariyama 2019

机译：用于模板计算的多FPGA加速器架构利用空间和时间可伸缩性

Multi-FPGA Accelerator for Scalable Stencil Computation with Constant Memory Bandwidth

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅