首页> 外文会议>IEEE International Conference on ASIC >A high utilization FPGA-based accelerator for variable-scale convolutional neural network

【24h】

A high utilization FPGA-based accelerator for variable-scale convolutional neural network

机译：基于FPGA的高效率可变尺度卷积神经网络加速器

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Convolutional Neural Network (CNN) plays an essential role in computer vision applications for high classification accuracy and robust generalization capability. In recent years, various GPU-based or application-specific hardware approaches have been proposed to accelerate CNN computations. However, for variable-scale CNNs, the utilization of DSP on chip is not able to achieve very high due to the boundary of image. In this paper, we propose an optimization framework to solve boundary problem and connect our accelerator with ARM processors and DDR4 memory through dual Advanced eXtensible Interface (AXI) bus. Each port is capable of a peak throughout of 1.6 GB/s in full duplex. The accelerator has the ability to perform 160 G-op/s at peak and achieve 96% computing resource utilization.

机译：卷积神经网络（CNN）在计算机视觉应用中起着重要作用，以实现高分类精度和强大的泛化能力。近年来，已提出了各种基于GPU或特定于应用程序的硬件方法来加速CNN计算。但是，对于可变规模的CNN，由于图像的边界，片上DSP的利用率无法达到很高。在本文中，我们提出了一个优化框架来解决边界问题，并通过双高级可扩展接口（AXI）总线将加速器与ARM处理器和DDR4内存连接。每个端口在全双工模式下的峰值峰值可达1.6 GB / s。该加速器能够在峰值时执行160 G-op / s的速度，并实现96％的计算资源利用率。

著录项

来源
《IEEE International Conference on ASIC》|2017年|944-947|共4页
会议地点
作者
Xin Li; Yujie Cai; Jun Han; Xiaoyang Zeng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Convolution; Optimization; Random access memory; Kernel; Shift registers; Acceleration;

机译：卷积;优化;随机存取存储器;内核;移位寄存器;加速;

相似文献

外文文献
中文文献
专利

1. A survey of FPGA-based accelerators for convolutional neural networks [J] . Neural computing & applications . 2020,第4期

机译：基于FPGA的卷积神经网络的加速器调查
2. FFConv: An FPGA-based Accelerator for Fast Convolution Layers in Convolutional Neural Networks [J] . AFZAL AHMAD, MUHAMMAD ADEEL PASHA ACM Transactions on Embedded Computing Systems . 2020,第2期

机译：FFCONV：卷积神经网络中的快速卷积层的基于FPGA的加速器
3. WinoNN: Optimizing FPGA-Based Convolutional Neural Network Accelerators Using Sparse Winograd Algorithm [J] . Wang Xuan, Wang Chao, Cao Jing, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2020,第11期

机译：WINONN：使用稀疏Winograd算法优化基于FPGA的卷积神经网络加速器
4. A High Utilization FPGA-Based Accelerator for Variable-Scale Convolutional Neural Network [C] . Xin Li, Yujie Cai, Jun Han, IEEE International Conference on ASIC . 2017

机译：基于高利用FPGA的可变型卷积神经网络的加速器
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. 3D Convolutional Neural Networks Initialized from Pretrained 2D Convolutional Neural Networks for Classification of Industrial Parts [O] . Ibon Merino, Jon Azpiazu, Anthony Remazeilles, 2021

机译：3D卷积神经网络从佩带的2D卷积神经网络初始化用于工业部件的分类
7. Improving Memory Utilization in Convolutional Neural Network Accelerators [O] . Petar Jokic, Stephane Emery, Luca Benini 2020

机译：提高卷积神经网络加速器中的内存利用

A high utilization FPGA-based accelerator for variable-scale convolutional neural network

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅