User-Transparent Translation of Machine Instructions to Programmable Hardware

机译：机器指令到可编程硬件的用户透明翻译

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe the design and evaluation of a JIT compiler for user-transparent acceleration of loops on FPGAs. We alleviate the need for FGPA CAD tools through an overlay designed for the pipelined execution of dataflow graphs (DFGs). We target systems that tightly integrate processors and FPGAs to share system memory, exemplified by the Intel QuickAssist platform. Our JIT compiler extracts the DFGs of innermost parallel loops in code and configures the overlay to execute the iterations of the loop in a pipelined fashion, improving throughput. Our preliminary evaluation of a functioning prototype of the compiler uses a simulator of the pipelined execution of DFGs on the overlay. It shows that over 72% of the loops in the 30 PolyBench benchmarks can be accelerated. In benchmarks where all loops are accelerated, an average speedup of 2.23X over CPU execution is achieved. The average speedup across all the 30 benchmarks is 1.62X with only three experiencing a slowdown. These results encourage us to continue our work on this approach.

机译：我们描述了JIT编译器的设计和评估，以实现用户对FPGA上的循环的透明加速。我们通过设计用于数据流图（DFG）的流水线执行的覆盖图来减轻对FGPA CAD工具的需求。我们以紧密集成处理器和FPGA以共享系统内存的系统为目标，以Intel QuickAssist平台为例。我们的JIT编译器提取代码中最里面的并行循环的DFG，并配置叠加层以流水线方式执行循环的迭代，从而提高了吞吐量。我们对编译器正常运行的原型的初步评估使用了覆盖层上DFG的流水线执行的模拟器。它显示了30个PolyBench基准测试中超过72％的循环可以加速。在所有循环都加速的基准测试中，CPU执行速度平均提高了2.23倍。所有30个基准测试的平均加速速度是1.62倍，只有三个速度有所下降。这些结果鼓励我们继续进行这种方法的工作。

著录项

来源
《IEEE International Parallel and Distributed Processing Symposium Workshops》|2018年|7-14|共8页
会议地点
作者
Leslie Barron; Tarek S. Abdelrahman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Field programmable gate arrays; Acceleration; Program processors; Benchmark testing; Prototypes; Runtime; Parallel processing;

机译：现场可编程门阵列;加速;程序处理器;基准测试;原型;运行时;并行处理;

相似文献

外文文献
中文文献
专利

1. Instruction Folding in a Hardware-Translation Based Java Virtual Machine [J] . Hitoshi Oi Journal of instruction-level parallelism . 2008,第2008期

机译：基于硬件翻译的Java虚拟机中的指令折叠
2. Instruction Folding in a Hardware-Translation Based Java Virtual Machine [J] . Hitoshi Oi Journal of instruction-level parallelism . 2008,第2008期

机译：基于硬件翻译的Java虚拟机中的指令折叠
3. Deep Learning Takes on Translation Improvements in hardware, the availability of massive amounts of data, and algorithmic upgrades are among the factors supporting better machine translation [J] . Monroe Don Communications of the ACM . 2017,第6期

机译：深度学习承担翻译工作硬件改进，大量数据的可用性以及算法升级是支持更好的机器翻译的因素
4. User-Transparent Translation of Machine Instructions to Programmable Hardware [C] . Leslie Barron, Tarek S. Abdelrahman IEEE International Parallel and Distributed Processing Symposium Workshops . 2018

机译：用户透明的计算机指令翻译可编程硬件
5. The Synthesis of Programmed Instruction and Online Education: Towards a Modern-Day Teaching Machine [D] . Root, William B. 2019

机译：编程教学与在线教育的合成：走向现代教学机器
6. Programmed learning in medical education. An experimental comparison of programmed instruction by teaching machine with conventional lecturing in the teaching of electrocardiography to final year medical students. [O] . S. G. Owen, R. Hall, J. Anderson, 1965

机译：医学教育中的程序学习。对最后一年医学生的心电图教学中采用常规讲授的教学机编程指令的实验比较。
7. Generation of hardware machine models from instruction set descriptions [O] . A. Fauth, M. Freericks, A. Knoll -1

机译：从指令集描述生成硬件机器型号
8. Construction of a Compact Automation for the Translation of Register Transfers into Machine Instructions [R] . Jongejan, J. H. , Dijkstra, E. J. , de Vries, D. D. 1988

机译：构建寄存器转换为机器指令的紧凑型自动化

User-Transparent Translation of Machine Instructions to Programmable Hardware

摘要

著录项

相似文献

相关主题

期刊订阅