首页> 外文会议>IEEE International Conference on ASIC >A high utilization FPGA-based accelerator for variable-scale convolutional neural network
【24h】

A high utilization FPGA-based accelerator for variable-scale convolutional neural network

机译:基于FPGA的高效率可变尺度卷积神经网络加速器

获取原文
获取外文期刊封面目录资料

摘要

Convolutional Neural Network (CNN) plays an essential role in computer vision applications for high classification accuracy and robust generalization capability. In recent years, various GPU-based or application-specific hardware approaches have been proposed to accelerate CNN computations. However, for variable-scale CNNs, the utilization of DSP on chip is not able to achieve very high due to the boundary of image. In this paper, we propose an optimization framework to solve boundary problem and connect our accelerator with ARM processors and DDR4 memory through dual Advanced eXtensible Interface (AXI) bus. Each port is capable of a peak throughout of 1.6 GB/s in full duplex. The accelerator has the ability to perform 160 G-op/s at peak and achieve 96% computing resource utilization.
机译:卷积神经网络(CNN)在计算机视觉应用中起着重要作用,以实现高分类精度和强大的泛化能力。近年来,已提出了各种基于GPU或特定于应用程序的硬件方法来加速CNN计算。但是,对于可变规模的CNN,由于图像的边界,片上DSP的利用率无法达到很高。在本文中,我们提出了一个优化框架来解决边界问题,并通过双高级可扩展接口(AXI)总线将加速器与ARM处理器和DDR4内存连接。每个端口在全双工模式下的峰值峰值可达1.6 GB / s。该加速器能够在峰值时执行160 G-op / s的速度,并实现96%的计算资源利用率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号