首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops >Parallel Processing on FPGA Combining Computation and Communication in OpenCL Programming
【24h】

Parallel Processing on FPGA Combining Computation and Communication in OpenCL Programming

机译:FPGA在OpenCL编程中结合计算和通信的并行处理

获取原文

摘要

In recent years, Field Programmable Gate Array (FPGA) has been a topic of interest in High Performance Computing (HPC) research. Although the biggest problem in utilizing FPGAs for HPC applications is in the difficulty of developing FPGAs, this problem is being solved by High Level Synthesis (HLS). We focus on very high-performance inter-FPGA communication capabilities. The absolute floating-point performance of an FPGA is lower than that of other common accelerators such as GPUs. However, we consider that we can apply FPGAs to a wide variety of HPC applications if we can combine computations and communications on an FPGA. The purpose of this paper is to implement a parallel processing system running applications implemented by HLS combining computations and communications in FPGAs. We propose the Channel over Ethernet (CoE) system that connects multiple FPGAs directly for OpenCL parallel programming. "Channel"' is one of the new extensions provided by the Intel OpenCL environment. They are ordinally used for intra-kernel communication inside an FPGA, but we extend them to external communication through the CoE system. In this paper, we introduce two benchmarks as demonstration of the CoE system. We achieved 29.77 Gbps in throughput (approximately 75% of the theoretical peak of 40Gbps) and 950 ns in latency on our system using the pingpong benchmark, which was implemented on Intel Arria10 FPGA. In addition, we evaluated the Himeno benchmark which is a sort of 3D-Computational Fluid Dynamics (CFD) on the system, and we achieved 23689MFLOPS with 4 FPGAs on a problem of size M. We also notice strong scalability, with a 3.93 times speedup compared to a single FPGA run, on the same problem size.
机译:近年来,现场可编程门阵列(FPGA)是高性能计算(HPC)研究的兴趣主题。虽然利用FPGA用于HPC应用的最大问题是在开发FPGA的难度下,但是通过高水平合成(HLS)解决了这个问题。我们专注于非常高性能的FPGA间通信能力。 FPGA的绝对浮点性能低于其他常见促进剂,例如GPU。但是,如果我们可以将计算和通信组合在FPGA上,我们认为我们可以考虑将FPGA应用于各种HPC应用程序。本文的目的是实施运行由HLS实现的应用程序的并行处理系统,这些应用组合在FPGA中的计算和通信。我们提出了以太网(COE)系统的通道,该系统直接连接多个FPGA以进行OpenCL并行编程。 “Channel”'是Intel OpenCL环境提供的新扩展之一。它们经过顺序用于FPGA内的内核通信,但我们通过COE系统将它们扩展到外部通信。在本文中,我们介绍了两个基准作为COE系统的示范。我们在使用Pingpong基准测试的系统中实现了29.77 Gbps(约占理论峰值的理论峰值的75%)和950 ns,在Intel Arria10 FPGA上实施。此外,我们评估了系统上的一种HIMENO基准,这是一种系统的一种3D计算流体动力学(CFD),我们通过4个FPGA实现了23689MFLOGS,在尺寸M的问题上。我们还注意到了强大的可扩展性,加速了3.93倍与单个FPGA运行相比,在相同的问题大小上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号