首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >Evaluation of Static Mapping for Dynamic Space-Shared Multi-task Processing on FPGAs
【24h】

Evaluation of Static Mapping for Dynamic Space-Shared Multi-task Processing on FPGAs

机译:FPGA动态空间共享多任务处理静态映射的评估

获取原文
获取原文并翻译 | 示例

摘要

Whilst FPGAs have been used in cloud ecosystems, it is still extremely challenging to achieve high compute density when mapping heterogeneous multi-tasks on shared resources at runtime. This work addresses this by treating the FPGA resource as a service and employing multi-task processing at the high level, design space exploration and static off-line partitioning in order to allow more efficient mapping of heterogeneous tasks onto the FPGA. In addition, a new, comprehensive runtime functional simulator is used to evaluate the effect of various spatial and temporal constraints on both the existing and new approaches when varying system design parameters. A comprehensive suite of real high performance computing tasks was implemented on a Nallatech 385 FPGA card and show that our approach can provide on average 2.9 x and 2.3 x higher system throughput for compute and mixed intensity tasks, while 0.2 x lower for memory intensive tasks due to external memory access latency and bandwidth limitations. The work has been extended by introducing a novel scheduling scheme to enhance temporal utilization of resources when using the proposed approach. Additional results for large queues of mixed intensity tasks (compute and memory) show that the proposed partitioning and scheduling approach can provide higher than 3 x system speedup over previous schemes.
机译:虽然FPGA已被用于云生态系统,但在运行时在共享资源上映射异构多任务时,实现高计算密度仍然非常具有挑战性。这项工作通过将FPGA资源视为服务并在高级,设计空间探索和静态离线分区中使用多任务处理来解决这一点,以便在FPGA上更有效地将异构任务映射到FPGA上。此外,新的全面运行时功能模拟器用于评估各种空间和时间限制在不同系统设计参数时对现有和新方法的影响。在Nallatech 385 FPGA卡上实现了一套全面的真正的高性能计算任务,并显示我们的方法可以平均提供2.9 x和2.3 x更高的系统吞吐量,用于计算和混合强度任务,而记忆密集型任务的0.2 x降低外部内存访问延迟和带宽限制。通过引入新的调度方案来提高新的调度方案来扩展,以加强使用所提出的方法时的资源的时间利用。混合强度任务(计算和内存)的大队列的其他结果表明,所提出的分区和调度方法可以通过先前的方案提供高于3 X系统的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号