首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops >Parallel Processing on FPGA Combining Computation and Communication in OpenCL Programming

【24h】

Parallel Processing on FPGA Combining Computation and Communication in OpenCL Programming

机译：FPGA在OpenCL编程中结合计算和通信的并行处理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, Field Programmable Gate Array (FPGA) has been a topic of interest in High Performance Computing (HPC) research. Although the biggest problem in utilizing FPGAs for HPC applications is in the difficulty of developing FPGAs, this problem is being solved by High Level Synthesis (HLS). We focus on very high-performance inter-FPGA communication capabilities. The absolute floating-point performance of an FPGA is lower than that of other common accelerators such as GPUs. However, we consider that we can apply FPGAs to a wide variety of HPC applications if we can combine computations and communications on an FPGA. The purpose of this paper is to implement a parallel processing system running applications implemented by HLS combining computations and communications in FPGAs. We propose the Channel over Ethernet (CoE) system that connects multiple FPGAs directly for OpenCL parallel programming. "Channel"' is one of the new extensions provided by the Intel OpenCL environment. They are ordinally used for intra-kernel communication inside an FPGA, but we extend them to external communication through the CoE system. In this paper, we introduce two benchmarks as demonstration of the CoE system. We achieved 29.77 Gbps in throughput (approximately 75% of the theoretical peak of 40Gbps) and 950 ns in latency on our system using the pingpong benchmark, which was implemented on Intel Arria10 FPGA. In addition, we evaluated the Himeno benchmark which is a sort of 3D-Computational Fluid Dynamics (CFD) on the system, and we achieved 23689MFLOPS with 4 FPGAs on a problem of size M. We also notice strong scalability, with a 3.93 times speedup compared to a single FPGA run, on the same problem size.

机译：近年来，现场可编程门阵列（FPGA）是高性能计算（HPC）研究的兴趣主题。虽然利用FPGA用于HPC应用的最大问题是在开发FPGA的难度下，但是通过高水平合成（HLS）解决了这个问题。我们专注于非常高性能的FPGA间通信能力。 FPGA的绝对浮点性能低于其他常见促进剂，例如GPU。但是，如果我们可以将计算和通信组合在FPGA上，我们认为我们可以考虑将FPGA应用于各种HPC应用程序。本文的目的是实施运行由HLS实现的应用程序的并行处理系统，这些应用组合在FPGA中的计算和通信。我们提出了以太网（COE）系统的通道，该系统直接连接多个FPGA以进行OpenCL并行编程。 “Channel”'是Intel OpenCL环境提供的新扩展之一。它们经过顺序用于FPGA内的内核通信，但我们通过COE系统将它们扩展到外部通信。在本文中，我们介绍了两个基准作为COE系统的示范。我们在使用Pingpong基准测试的系统中实现了29.77 Gbps（约占理论峰值的理论峰值的75％）和950 ns，在Intel Arria10 FPGA上实施。此外，我们评估了系统上的一种HIMENO基准，这是一种系统的一种3D计算流体动力学（CFD），我们通过4个FPGA实现了23689MFLOGS，在尺寸M的问题上。我们还注意到了强大的可扩展性，加速了3.93倍与单个FPGA运行相比，在相同的问题大小上。

著录项

来源
《IEEE International Parallel and Distributed Processing Symposium Workshops 》|2019年|1 v.|共10页
会议地点
作者
Norihisa Fujita; Ryohei Kobayashi; Yoshiki Yamaguchi; Taisuke Boku;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.133;
关键词
Field programmable gate arrays; Kernel; Ethernet; Hardware design languages; Benchmark testing; Throughput; Parallel processing;

机译：现场可编程门阵列;内核;以太网;硬件设计语言;基准测试;吞吐量;并行处理;

相似文献

外文文献
中文文献
专利

1. OPENCL PROGRAMMIERUNG FUR HETEROGENE COMPUTING-SYSTEME: PARALLEL PROCESSING SCHNELLER UND EINFACHER - Parallel programmieren ganz einfach [J] . Todd Roberts MegaLink: Elektronik + Automation . 2013 ,第4期

机译：异构计算机系统的OPENCL编程：并行处理更快，更轻松-并行编程非常容易
2. PARALLELIZATION OF SEQUENTIAL PROGRAMS: DISTRIBUTION OF ARRAYS AMONG PROCESSORS AND STRUCTURIZATION OF COMMUNICATIONS [J] . E. V. Adutskevich, N. A. Likhoded, A. O. Sikorsky Cybernetics and Systems Analysis . 2012 ,第1期

机译：顺序程序的并行化：处理程序之间的数组分布和通信的结构化
3. Lower the cost of intelligent power control with FPGAs: Combining a programmable solution with an industry-standard processor core can save time, money, and real estate [J] . WENDY LDCKHART Embedded Systems Design Europe . 2008 ,第88期

机译：使用FPGA降低智能电源控制的成本：将可编程解决方案与行业标准处理器内核结合在一起可以节省时间，金钱和空间
4. Parallel Processing on FPGA Combining Computation and Communication in OpenCL Programming [C] . Norihisa Fujita, Ryohei Kobayashi, Yoshiki Yamaguchi, IEEE International Parallel and Distributed Processing Symposium Workshops . 2019

机译：OpenCL编程中FPGA组合计算与通信的并行处理
5. Program Acceleration in a Heterogeneous Computing Environment Using OpenCL, FPGA, and CPU. [D] . Hoffman, Herman Noel. 2017

机译：使用OpenCL，FPGA和CPU在异构计算环境中进行程序加速。
6. OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems [O] . John E. Stone, David Gohara, Guochun Shi -1

机译：OpenCL的：一个并行编程标准的异构计算系统
7. Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL [O] . Hamid Reza Zohouri, Artur Podobas, Satoshi Matsuoka 2018

机译：使用OpenCL在FPGA上的高性能模板计算组合的空间和时间阻塞
8. High Speed Publication Subscription Brokering Through Highly Parallel Processing on Field Programmable Gate Array (FPGA) [R] . Lin, C. 2010

机译：通过现场可编程门阵列（FpGa）的高度并行处理实现高速发布订购

Parallel Processing on FPGA Combining Computation and Communication in OpenCL Programming

摘要

著录项

相似文献

相关主题

期刊订阅