首页> 外文期刊>International journal of reconfigurable computing >Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units
【24h】

Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units

机译:将基于处理器指令的基于循环的跟踪透明运行时迁移到可重新配置的处理单元

获取原文
           

摘要

The ability to map instructions running in a microprocessor to a reconfigurable processing unit (RPU), acting as a coprocessor, enables the runtime acceleration of applications and ensures code and possibly performance portability. In this work, we focus on the mapping of loop-based instruction traces (called Megablocks) to RPUs. The proposed approach considers offline partitioning and mapping stages without ignoring their future runtime applicability. We present a toolchain that automatically extracts specific trace-based loops, called Megablocks, from MicroBlaze instruction traces and generates an RPU for executing those loops. Our hardware infrastructure is able to move loop execution from the microprocessor to the RPU transparently, at runtime, and without changing the executable binaries. The toolchain and the system are fully operational. Three FPGA implementations of the system, differing in the hardware interfaces used, were tested and evaluated with a set of 15 application kernels. Speedups ranging from 1.26×to 3.69×were achieved for the best alternative using a MicroBlaze processor with local memory.
机译:将微处理器中运行的指令映射到充当协处理器的可重配置处理单元(RPU)的能力,可以加速应用程序的运行时并确保代码以及可能的性能可移植性。在这项工作中,我们专注于将基于循环的指令跟踪(称为Megablock)映射到RPU。所提出的方法考虑了离线分区和映射阶段,而不忽略它们将来的运行时适用性。我们提供了一个工具链,可以从MicroBlaze指令跟踪中自动提取特定的基于跟踪的循环,称为Megablock,并生成用于执行这些循环的RPU。我们的硬件基础架构能够在运行时透明地将循环执行从微处理器转移到RPU,而无需更改可执行二进制文件。工具链和系统完全可以运行。使用一组15个应用程序内核对系统的三种FPGA实现(所使用的硬件接口有所不同)进行了测试和评估。使用具有本地内存的MicroBlaze处理器,可以实现1.26倍至3.69倍的加速,以获得最佳替代方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号