首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops >User-Transparent Translation of Machine Instructions to Programmable Hardware
【24h】

User-Transparent Translation of Machine Instructions to Programmable Hardware

机译:机器指令到可编程硬件的用户透明翻译

获取原文

摘要

We describe the design and evaluation of a JIT compiler for user-transparent acceleration of loops on FPGAs. We alleviate the need for FGPA CAD tools through an overlay designed for the pipelined execution of dataflow graphs (DFGs). We target systems that tightly integrate processors and FPGAs to share system memory, exemplified by the Intel QuickAssist platform. Our JIT compiler extracts the DFGs of innermost parallel loops in code and configures the overlay to execute the iterations of the loop in a pipelined fashion, improving throughput. Our preliminary evaluation of a functioning prototype of the compiler uses a simulator of the pipelined execution of DFGs on the overlay. It shows that over 72% of the loops in the 30 PolyBench benchmarks can be accelerated. In benchmarks where all loops are accelerated, an average speedup of 2.23X over CPU execution is achieved. The average speedup across all the 30 benchmarks is 1.62X with only three experiencing a slowdown. These results encourage us to continue our work on this approach.
机译:我们描述了JIT编译器的设计和评估,以实现用户对FPGA上的循环的透明加速。我们通过设计用于数据流图(DFG)的流水线执行的覆盖图来减轻对FGPA CAD工具的需求。我们以紧密集成处理器和FPGA以共享系统内存的系统为目标,以Intel QuickAssist平台为例。我们的JIT编译器提取代码中最里面的并行循环的DFG,并配置叠加层以流水线方式执行循环的迭代,从而提高了吞吐量。我们对编译器正常运行的原型的初步评估使用了覆盖层上DFG的流水线执行的模拟器。它显示了30个PolyBench基准测试中超过72%的循环可以加速。在所有循环都加速的基准测试中,CPU执行速度平均提高了2.23倍。所有30个基准测试的平均加速速度是1.62倍,只有三个速度有所下降。这些结果鼓励我们继续进行这种方法的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号