Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units

Jo?oBispo; NunoPaulino; Jo?o M. P.Cardoso; Jo?o CanasFerreira

首页> 外文期刊>International journal of reconfigurable computing >Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units

【24h】

Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units

机译：将基于处理器指令的基于循环的跟踪透明运行时迁移到可重新配置的处理单元

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability to map instructions running in a microprocessor to a reconfigurable processing unit (RPU), acting as a coprocessor, enables the runtime acceleration of applications and ensures code and possibly performance portability. In this work, we focus on the mapping of loop-based instruction traces (called Megablocks) to RPUs. The proposed approach considers offline partitioning and mapping stages without ignoring their future runtime applicability. We present a toolchain that automatically extracts specific trace-based loops, called Megablocks, from MicroBlaze instruction traces and generates an RPU for executing those loops. Our hardware infrastructure is able to move loop execution from the microprocessor to the RPU transparently, at runtime, and without changing the executable binaries. The toolchain and the system are fully operational. Three FPGA implementations of the system, differing in the hardware interfaces used, were tested and evaluated with a set of 15 application kernels. Speedups ranging from 1.26×to 3.69×were achieved for the best alternative using a MicroBlaze processor with local memory.

机译：将微处理器中运行的指令映射到充当协处理器的可重配置处理单元（RPU）的能力，可以加速应用程序的运行时并确保代码以及可能的性能可移植性。在这项工作中，我们专注于将基于循环的指令跟踪（称为Megablock）映射到RPU。所提出的方法考虑了离线分区和映射阶段，而不忽略它们将来的运行时适用性。我们提供了一个工具链，可以从MicroBlaze指令跟踪中自动提取特定的基于跟踪的循环，称为Megablock，并生成用于执行这些循环的RPU。我们的硬件基础架构能够在运行时透明地将循环执行从微处理器转移到RPU，而无需更改可执行二进制文件。工具链和系统完全可以运行。使用一组15个应用程序内核对系统的三种FPGA实现（所使用的硬件接口有所不同）进行了测试和评估。使用具有本地内存的MicroBlaze处理器，可以实现1.26倍至3.69倍的加速，以获得最佳替代方案。

著录项

来源
《International journal of reconfigurable computing》 |2013年第2期|共20页
作者
Jo?oBispo; NunoPaulino; Jo?o M. P.Cardoso; Jo?o CanasFerreira;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units [J] . Joao Bispo, Nuno Paulino, Joao M. P. Cardoso, International journal of reconfigurable computing . 2013,第期

机译：将基于处理器指令的基于循环的跟踪透明运行时迁移到可重新配置的处理单元
2. RDMM: Runtime dynamic migration mechanism of distributed cache for reconfigurable array processor [J] . Integration . 2020,第May期

机译：RDMM：用于可重配置阵列处理器的分布式缓存的运行时动态迁移机制
3. Exploring opportunities to improve the performance of a reconfigurable instruction set processor [J] . N. VASSILIADIS, G. THEODORIDIS, S. NIKOLAIDIS International journal of electronics . 2007,第5期

机译：探索改善可重构指令集处理器性能的机会
4. Instruction pre-processing in trace processors [C] . Jacobson, Q., Smith, . 1999

机译：跟踪处理器中的指令预处理
5. Runtime reconfigurable multi-processor architectures [D] . Jimenez, Lomberto P. 2015

机译：运行时可重新配置的多处理器体系结构
6. Neuron splitting in compute-bound parallel network simulations enables runtime scaling with twice as many processors [O] . Michael L. Hines, Hubert Eichner, Felix Schürmann -1

机译：在计算范围内的并行网络仿真中进行神经元拆分可以使用两倍的处理器实现运行时扩展
7. Improving Instruction Level Parallelism through Reconfigurable Units in Superscalar Processors [O] . Tameesh Suri 2008

机译：通过超标量处理器中的可重构单元提高指令级并行性

Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units

摘要

著录项

相似文献

相关主题

期刊订阅