首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Hardware Implementation on FPGA for Task-Level Parallel Dataflow Execution Engine
【24h】

Hardware Implementation on FPGA for Task-Level Parallel Dataflow Execution Engine

机译:用于任务级并行数据流执行引擎的FPGA硬件实现

获取原文
获取原文并翻译 | 示例

摘要

Heterogeneous multicore platform has been widely used in various areas to achieve both power efficiency and high performance. However, it poses significant challenges to researchers to uncover more coarse-grained task level parallelization. In order to support automatic task parallel execution, this paper proposes a FPGA implementation of a hardware out-of-order scheduler on heterogeneous multicore platform. The scheduler is capable of exploring potential inter-task dependency, leading to a significant acceleration of dependence-aware applications. With the help of renaming scheme, the task dependencies are detected automatically during execution, and then task-level Write-After-Write (WAW) and Write-After-Read (WAR) dependencies can be eliminated dynamically. We extended the instruction level renaming techniques to perform task-level out-of-order execution, and implemented a prototype on a state-of-art Xilinx Virtex-5 FPGA device. Given the reconfigurable characteristic of FPGA, our scheduler supports changing accelerators at runtime to improve the flexibility. Experimental results demonstrate that our scheduler is efficient at both performance and resources usage.
机译:异构多核平台已广泛应用于各个领域,以实现电源效率和高性能。但是,这给研究人员发现更大的任务级并行化带来了巨大的挑战。为了支持自动任务并行执行,本文提出了一种在异构多核平台上的硬件无序调度程序的FPGA实现。调度程序能够探索潜在的任务间依赖关系,从而极大地加快了依赖关系感知的应用程序的速度。借助重命名方案,可以在执行期间自动检测任务相关性,然后可以动态消除任务级别的写后写(WAW)和写后读(WAR)依赖关系。我们扩展了指令级重命名技术,以执行任务级无序执行,并在最新的Xilinx Virtex-5 FPGA器件上实现了原型。鉴于FPGA的可重配置特性,我们的调度程序支持在运行时更改加速器以提高灵活性。实验结果表明,我们的调度程序在性能和资源使用方面都是高效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号