首页> 外文期刊>Microprocessors and microsystems >A configurable multiplex data transfer model for asynchronous and heterogeneous FPGA accelerators on single DMA device
【24h】

A configurable multiplex data transfer model for asynchronous and heterogeneous FPGA accelerators on single DMA device

机译:单个DMA设备异步和异构FPGA加速器的可配置多路复用数据传输模型

获取原文
获取原文并翻译 | 示例
       

摘要

To reduce DMA utilization for multiple algorithm IPs on FPGA, a channel configurable and multiplex DMA device (CMDMA) is proposed for asynchronous and heterogeneous algorithm IPs. Firstly, we abstract the entities and data-flow in CMDMA system with a formal description for function definition and work-flow analysis. Then based on the functions and work-flow, we design and implement a prototype of CMDMA, which includes CMDMA software driver (SW) and hardware circuits (HW) of one DMA IP, a configurable input switch (CISwitch), algorithm IPs, and an asynchronous output switch (AOSwitch). The configurable function of CMDMA is implemented by CISwitch through a configuration port in HW-level, and a configurable Round-Robin (CRR) algorithm is proposed to implement channel and input data schedule in SW-level. For output, a channel distinguishable output buffer (ChnDistBuf) is proposed, which is able to deliver channel ID and data size to SW earlier than the end time of an algorithm IP. With a double interrupt coordination method of both ChnDistBuf and algorithm IPs, CMDMA is able to successively store complete output data from different algorithm IPs. With a double interrupt coordination method of both ChnDistBuf and algorithm IPs, CMDMA is able to successively store complete output data from different algorithm IPs. The experiments based on 4 heterogeneous matrix multiplication algorithm IPs on Xilinx Zynq platform show that CMDMA is able to improve about 8% 29% average algorithm acceleration rates on single algorithm IP compared to the exclusive method that one DMA works for one algorithm IP only, and it is able to increase about 10-40 MB/s and 5-15 MB/s of DMA input and output data throughput with multiple algorithm IPs running in parallel. Moreover, the extended LUT and FF resources in CMDMA are 756 and 1219 , both of which are about 1% of Zynq platform. Besides, in a double CNN algorithm IPs test on Mnist application, an enhanced function of data broadcasting in CMDMA is able to improve 4 s than the system with 4 exclusive DMA running in parallel, meanwhile reduce 3 DMA utilization and 0. 03 W power consumption. (c) 2020 Elsevier B.V. All rights reserved.
机译:为了降低FPGA上多算法IP的DMA利用,提出了一种通道可配置和多路复用DMA设备(CMDMA),用于异步和异构算法IP。首先,我们摘要CMDMA系统中的实体和数据流,具有功能定义和工作流程分析的正式描述。然后基于功能和工作流程,我们设计和实现CMDMA的原型,包括一个DMA IP的CMDMA软件驱动器(SW)和硬件电路(HW),可配置的输入开关(CISWitch),算法IP和异步输出开关(AOSWitch)。 CMDMA的可配置功能由CISwitch通过HW级的配置端口实现,并且提出了一种可配置的循环(CRR)算法来实现SW级中的信道和输入数据计划。对于输出,提出了一种通道可区分的输出缓冲器(CHNDISTBUF),其能够将信道ID和数据大小转换为比算法IP的结束时间的SW。对于CHNDistBuf和算法IP的双中断协调方法,CMDMA能够连续存储来自不同算法IP的完整输出数据。对于CHNDistBuf和算法IP的双中断协调方法,CMDMA能够连续存储来自不同算法IP的完整输出数据。基于4个异构矩阵乘法算法IPS上的实验Xilinx Zynq平台,显示了CMDMA能够与单算法IP上的约8%29%的平均算法加速率相比,一个DMA仅用于一种算法IP的独占方法,它能够增加大约10-40 MB / s和5-15 MB / s的DMA输入和输出数据吞吐量,并与多个算法IP并行运行。此外,CMDMA中的扩展LUT和FF资源为756和1219,其中两者都是Zynq平台的1%。此外,在MNIST应用的双CNN算法IPS测试中,CMDMA中的数据广播的增强功能能够与并行运行4个独占DMA的系统来改善4 S,同时减少3 DMA利用率和0. 03 W功耗。 。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Microprocessors and microsystems》 |2020年第9期|103174.1-103174.21|共21页
  • 作者单位

    Beijing Univ Technol Beijing Engn Res Ctr IoT Software & Syst Beijing 100124 Peoples R China;

    Beijing Univ Technol Beijing Engn Res Ctr IoT Software & Syst Beijing 100124 Peoples R China;

    Beijing Univ Technol Beijing Engn Res Ctr IoT Software & Syst Beijing 100124 Peoples R China;

    Beijing Univ Technol Beijing Engn Res Ctr IoT Software & Syst Beijing 100124 Peoples R China;

    Beijing Univ Technol Beijing Engn Res Ctr IoT Software & Syst Beijing 100124 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    DMA; Multiplex; Switch; System architecture; FPGA;

    机译:DMA;多路复用;开关;系统架构;FPGA;
  • 入库时间 2022-08-18 21:28:38

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号