...
首页> 外文期刊>Microprocessors and microsystems >IMORC: An infrastructure and architecture template for implementing high-performance reconfigurable FPGA accelerators
【24h】

IMORC: An infrastructure and architecture template for implementing high-performance reconfigurable FPGA accelerators

机译:IMORC:用于实现高性能可重构FPGA加速器的基础架构和体系结构模板

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The design, implementation and optimization of FPGA accelerators is a challenging task, especially when the accelerator comprises multiple compute cores distributed across CPU and FPGA resources and memories and exhibits data-dependent runtime behavior. In order to simplify the development of FPGA accelerators we propose IMORC, an infrastructure and architecture template that helps raising the level of abstraction. The IMORC development flow bases on a modeling technique for visualizing an application's communication demand and an architecture template that aids the developer in implementing the design. The architectural template consists of a versatile on-chip interconnect with asynchronous FIFOs and bitwidth conversion placed into the communication links, a performance monitoring infrastructure for collecting performance information during runtime and a set of generic infrastructure cores which are frequently needed in accelerator designs. We demonstrate the usefulness of the IMORC development flow by means of the case study of accelerating the kth nearest neighbor thinning problem, where IMORC greatly helps us in understanding the communication demand and in implementing the application. With the integrated performance monitoring infrastructure, we gain insights into the data-dependent behavior of the accelerator that helps us in identifying bottlenecks and optimizing the accelerator to achieve a speedup of 10x to 40x over an optimized CPU implementation.
机译:FPGA加速器的设计,实现和优化是一项艰巨的任务,尤其是当加速器包括分布在CPU和FPGA资源和内存中的多个计算核心并表现出与数据相关的运行时行为时。为了简化FPGA加速器的开发,我们提出了IMORC,这是一种有助于提高抽象水平的基础架构和体系结构模板。 IMORC开发流程基于用于可视化应用程序通信需求的建模技术和可帮助开发人员实施设计的体系结构模板。该架构模板包括通用的片上互连以及异步FIFO和位宽转换(放置在通信链路中),用于在运行时收集性能信息的性能监视基础结构,以及加速器设计中经常需要的一组通用基础结构核心。通过加速第k个最近邻稀疏问题的案例研究,我们证明了IMORC开发流程的有用性,其中IMORC在很大程度上帮助我们理解了通信需求并实现了应用。借助集成的性能监视基础结构,我们可以深入了解加速器的数据相关行为,从而帮助我们识别瓶颈并优化加速器,从而使优化的CPU实施速度提高10倍至40倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号