Optimizing Convolutional Neural Network Accelerator on Low-Cost FPGA

Truong Quang Vinh; Dinh Viet Hai

首页> 外文期刊>Journal of circuits, systems and computers >Optimizing Convolutional Neural Network Accelerator on Low-Cost FPGA

【24h】

Optimizing Convolutional Neural Network Accelerator on Low-Cost FPGA

机译：优化卷积神经网络加速器低成本FPGA

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional neural network (CNN) is one of the most promising algorithms that outweighs other traditional methods in terms of accuracy in classification tasks. However, several CNNs, such as VGG, demand a huge computation in convolutional layers. Many accelerators implemented on powerful FPGAs have been introduced to address the problems. In this paper, we present a VGG-based accelerator which is optimized for a low-cost FPGA. In order to optimize the FPGA resource of logic element and memory, we propose a dedicated input buffer that maximizes the data reuse. In addition, we design a low resource processing engine with the optimal number of Multiply Accumulate (MAC) units. In the experiments, we use VGG16 model for inference to evaluate the performance of our accelerator and achieve a throughput of 38.8GOPS at a clock speed of 150MHz on Intel Cyclone V SX SoC. The experimental results show that our design is better than previous works in terms of resource efficiency.

机译：卷积神经网络（CNN）是最有前途的算法之一，在分类任务中的准确性方面超过了其他传统方法。然而，几个CNN，例如VGG，需要在卷积层中计算巨大的计算。已经引入了许多在强大的FPGA上实施的加速器来解决问题。在本文中，我们介绍了一种基于VGG的加速器，其针对低成本FPGA进行了优化。为了优化逻辑元素和内存的FPGA资源，我们提出了一个专用输入缓冲区，可最大化数据重用。此外，我们设计了具有最佳累积（MAC）单元的最佳数量的低资源处理引擎。在实验中，我们使用VGG16模型来推理，以评估我们的加速器的性能，并在英特尔Cyclone V SX SoC上以150MHz的时钟速度实现38.8GGOP的吞吐量。实验结果表明，在资源效率方面，我们的设计优于以前的工作。

著录项

来源
《Journal of circuits, systems and computers》 |2021年第11期|2150193.1-2150193.16|共16页
作者
Truong Quang Vinh; Dinh Viet Hai;
展开▼
作者单位

Vietnam Natl Univ Ho Chi Minh Ho Chi Minh City Univ Technol Ho Chi Minh City Vietnam;

Vietnam Natl Univ Ho Chi Minh Ho Chi Minh City Univ Technol Ho Chi Minh City Vietnam;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Convolutional neural network (CNN); VGG; FPGA; Cyclone V; accelerator;

机译：卷积神经网络（CNN）;VGG;FPGA;Cyclone V;加速器;

相似文献

外文文献
中文文献
专利

1. An Interactive platform for low-cost 3D building modeling from VGI data using convolutional neural network [J] . Hongchao Fan, Gefei Kong, Chaoquan Zhang 地球大数据（英文版） . 2021,第001期
2. Efficient,high-resolution topology optimization method based on convolutional neural networks [J] . Liang XUE, Jie LIU, Guilin WEN, 机械工程前沿：英文版 . 2021,第001期
3. WinoNN: Optimizing FPGA-Based Convolutional Neural Network Accelerators Using Sparse Winograd Algorithm [J] . Wang Xuan, Wang Chao, Cao Jing, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2020,第11期

机译：WINONN：使用稀疏Winograd算法优化基于FPGA的卷积神经网络加速器
4. Throughput-Optimized FPGA Accelerator for Deep Convolutional Neural Networks [J] . Liu Zhiqiang, Dou Yong, Jiang Jingfei, ACM transactions on reconfigurable technology and systems . 2017,第3期

机译：用于深度卷积神经网络的吞吐量优化的FPGA加速器
5. FFConv: An FPGA-based Accelerator for Fast Convolution Layers in Convolutional Neural Networks [J] . AFZAL AHMAD, MUHAMMAD ADEEL PASHA ACM Transactions on Embedded Computing Systems . 2020,第2期

机译：FFCONV：卷积神经网络中的快速卷积层的基于FPGA的加速器
6. A convolutional neural network accelerator based on FPGA for buffer optimization [C] . Haotian Wang, Yinghai Zhao, Fan Gao IEEE Advanced Information Technology, Electronic and Automation Control Conference . 2021

机译：基于FPGA进行缓冲优化的卷积神经网络加速器
7. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
8. SoC FPGA Accelerated Sub-Optimized Binary Fully Convolutional Neural Network for Robotic Floor Region Segmentation [O] . Chi-Chia Sun, Afaroj Ahamad, Pin-He Liu 2020

机译：SOC FPGA加速子优化二元完全卷积神经网络用于机器人楼层区分割
9. Implementation of FPGA accelerator architecture for Convolution Neural Network in Emotional Recognition System [O] . Samson Immanuel J Et.al 2021

机译：情绪识别系统中卷积神经网络FPGA加速器架构的实现
10. Hierarchical Neural Network Based Data Processing System for Ground- Penetrating Radar. An End of Year Report for CH/1049/6: Application of Neural Networks Coupled With Genetic Algorithms to Optimize Soil Cleanup Operations in Cold Climates [R] . Sullivan, J. M. 1997

机译：基于分层神经网络的探地雷达数据处理系统。 CH / 1049/6的年终报告：神经网络与遗传算法相结合的应用，以优化寒冷气候下的土壤清理作业

Optimizing Convolutional Neural Network Accelerator on Low-Cost FPGA

摘要

著录项

相似文献

相关主题

期刊订阅