首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops >User-Transparent Translation of Machine Instructions to Programmable Hardware
【24h】

User-Transparent Translation of Machine Instructions to Programmable Hardware

机译:用户透明的计算机指令翻译可编程硬件

获取原文

摘要

We describe the design and evaluation of a JIT compiler for user-transparent acceleration of loops on FPGAs. We alleviate the need for FGPA CAD tools through an overlay designed for the pipelined execution of dataflow graphs (DFGs). We target systems that tightly integrate processors and FPGAs to share system memory, exemplified by the Intel QuickAssist platform. Our JIT compiler extracts the DFGs of innermost parallel loops in code and configures the overlay to execute the iterations of the loop in a pipelined fashion, improving throughput. Our preliminary evaluation of a functioning prototype of the compiler uses a simulator of the pipelined execution of DFGs on the overlay. It shows that over 72% of the loops in the 30 PolyBench benchmarks can be accelerated. In benchmarks where all loops are accelerated, an average speedup of 2.23X over CPU execution is achieved. The average speedup across all the 30 benchmarks is 1.62X with only three experiencing a slowdown. These results encourage us to continue our work on this approach.
机译:我们描述了对FPGA上循环的用户透明加速器的JIT编译器的设计和评估。我们通过专为流水线执行数据流图(DFG)而设计的覆盖层来减轻FGPA CAD工具的需求。我们瞄准将处理器和FPGA紧密集成的系统,以共享系统内存,示出了英特尔QuickAssist平台。我们的JIT编译器在代码中提取最内部并行循环的DFG,并配置叠加以以流水线方式执行循环的迭代,提高吞吐量。我们对编译器功能原型的初步评估使用覆盖层上的流水线执行DFG的模拟器。它表明,可以加速30多个多纤板基准中的循环超过72±0.在所有循环加速的基准中,实现了通过CPU执行的平均速度为2.23倍。所有30个基准测试的平均加速度为1.62倍,只有三个经历放缓。这些结果鼓励我们继续采取这种方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号