首页> 外文会议>International Conference on Electron Devices and Solid-State Circuits >Composite FPGA-based Accelerator for Deep Convolutional Neural Networks

【24h】

Composite FPGA-based Accelerator for Deep Convolutional Neural Networks

机译：用于深度卷积神经网络的基于FPGA的复合加速器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional neural network (CNN) can achieve high prediction accuracy while they built complex models; however, high power consumption, memory demands, and bandwidth resource consumption by the models are creating enormous challenges in their actual deployment. To achieve the rapid prediction capabilities of high-precision models, a modified convolutional neural network model based on the GoogLeNet model has been proposed in this work, and a composite-structure convolutional neural network hardware accelerator compatible with the parallel computing model has been designed. An experimental model based the modified structure has been established and CIFAR-10 dataset was used to evaluate the prediction accuracy. The accelerator achieved 663 FPS peak performance with a 9.06% error rate, and was implemented on a Xilinx VC709. Compared to CPU and GPU, its energy efficiency increased by a factor of 7.1 and 1.9, respectively, achieving a high-precision complex network computing acceleration.

机译：卷积神经网络（CNN）在构建复杂模型时可以实现较高的预测精度;但是，模型的高功耗，内存需求和带宽资源消耗正在其实际部署中带来巨大挑战。为了实现高精度模型的快速预测能力，本文提出了一种基于GoogLeNet模型的改进的卷积神经网络模型，并设计了与并行计算模型兼容的复合结构卷积神经网络硬件加速器。建立了基于改进结构的实验模型，并使用CIFAR-10数据集评估了预测精度。该加速器在Xilinx VC709上实现，具有663 FPS的峰值性能，错误率9.06％。与CPU和GPU相比，其能效分别提高了7.1和1.9倍，实现了高精度的复杂网络计算加速。

著录项

来源
《International Conference on Electron Devices and Solid-State Circuits 》|2019年|1-3|共3页
会议地点
作者
Huan Zhang; Yuan Yang; Yang Xiao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
convolutional neural nets; field programmable gate arrays; low-power electronics; parallel processing;

机译：卷积神经网络现场可编程门阵列低功耗电子并行处理;

相似文献

外文文献
中文文献
专利

1. FPGA-Based Deep Convolutional Neural Network Accelerator Design Techniques for the Handwritten Number Recognizer [J] . Advanced Science Letters . 2018 ,第3期

机译：基于FPGA的深卷积神经网络加速器设计技术识别器
2. A survey of FPGA-based accelerators for convolutional neural networks [J] . Neural computing & applications . 2020 ,第4期

机译：基于FPGA的卷积神经网络的加速器调查
3. FFConv: An FPGA-based Accelerator for Fast Convolution Layers in Convolutional Neural Networks [J] . AFZAL AHMAD, MUHAMMAD ADEEL PASHA ACM Transactions on Embedded Computing Systems . 2020 ,第2期

机译：FFCONV：卷积神经网络中的快速卷积层的基于FPGA的加速器
4. Composite FPGA-based Accelerator for Deep Convolutional Neural Networks [C] . Huan Zhang, Yuan Yang, Yang Xiao International Conference on Electron Devices and Solid-State Circuits . 2019

机译：基于复合FPGA的深度卷积神经网络的加速器
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network [O] . Seung Seog Han, Gyeong Hun Park, Woohyung Lim, -1

机译：深度神经网络在灰指甲诊断中显示出与皮肤科医生相当且通常优于皮肤病的性能：通过基于区域的卷积深度神经网络自动构建灰指甲数据集
7. FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling [O] . Masayuki Shimoda, Youki Sada, Hiroki Nakahara 2021

机译：基于FPGA的层间流水线加速器，用于滤波器的重量平衡的稀疏完全卷积网络，具有重叠的平铺

Composite FPGA-based Accelerator for Deep Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅