Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators

机译：利用数据压缩优化基于FPGA的卷积神经网络加速器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional Neural Network (CNN) has been extensively employed in research fields including multimedia recognition, computer version, etc. Various FPGA-based accelerators for deep CNN have been proposed to achieve high energy-efficiency. For some FPGA-based CNN accelerators in embedded systems, such as UAVs, IoT, and wearable devices, their overall performance is greatly bounded by the limited data bandwidth to the on-board DRAM. In this paper, we argue that it is feasible to overcome the bandwidth bottleneck using data compression techniques. We propose an effective roofline model to explore design tradeoff between computation logic and data bandwidth after applying data compression techniques to parameters of CNNs. We implement a decompression module and a CNN accelerator on a single Xilinx VC707 FPGA board with two different compression/decompression algorithms as case studies. Under a scenario with limited data bandwidth, the peak performance of our implementation can outperform designs using previous methods by 3.2× in overall performance.

机译：卷积神经网络（CNN）已广泛使用，包括多媒体识别，计算机版等的研究领域。已经提出了用于深度CNN的各种FPGA的加速器，以实现高能量效率。对于嵌入式系统中的一些基于FPGA的CNN加速器，例如UVS，IOT和可穿戴设备，其整体性能受到车载DRAM的有限数据带宽大大限制。在本文中，我们认为使用数据压缩技术克服带宽瓶颈是可行的。我们提出了一种有效的屋顶线模型，在将数据压缩技术应用于CNN的参数之后，探索计算逻辑和数据带宽之间的设计权衡。我们在单个Xilinx VC707 FPGA板上实现了一个解压缩模块和CNN加速器，其具有两个不同的压缩/解压缩算法作为案例研究。在数据带宽有限的情况下，我们的实现的峰值性能可以在整体性能中使用以前的方法使用以前的方法来倾斜设计3.2倍。

著录项

来源
《International Symposium on Advanced Parallel Processing Technologies》|2017年|128p|共13页
会议地点
作者
Yijin Guan; Ningyi Xu; Chen Zhang; Zhihang Yuan; Jason Cong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP338.6-53;
关键词
CNN; FPGA; Compression/decompression;

机译：CNN;FPGA;压缩/减压;

相似文献

外文文献
中文文献
专利

1. WinoNN: Optimizing FPGA-Based Convolutional Neural Network Accelerators Using Sparse Winograd Algorithm [J] . Wang Xuan, Wang Chao, Cao Jing, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2020,第11期

机译：WINONN：使用稀疏Winograd算法优化基于FPGA的卷积神经网络加速器
2. A survey of FPGA-based accelerators for convolutional neural networks [J] . Neural computing & applications . 2020,第4期

机译：基于FPGA的卷积神经网络的加速器调查
3. FFConv: An FPGA-based Accelerator for Fast Convolution Layers in Convolutional Neural Networks [J] . AFZAL AHMAD, MUHAMMAD ADEEL PASHA ACM Transactions on Embedded Computing Systems . 2020,第2期

机译：FFCONV：卷积神经网络中的快速卷积层的基于FPGA的加速器
4. Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators [C] . Yijin Guan, Ningyi Xu, Chen Zhang, International symposium on advanced parallel processing technologies . 2017

机译：使用数据压缩优化基于FPGA的卷积神经网络加速器
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. Automatic Recognition of Holistic Functional Brain Networks Using Iteratively Optimized Convolutional Neural Networks (IO-CNN) with Weak Label Initialization [O] . Yu Zhao, Fangfei Ge, Tianming Liu -1

机译：使用带有弱标签初始化的迭代优化卷积神经网络（IO-CNN）自动识别整体功能性脑网络
7. FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling [O] . Masayuki Shimoda, Youki Sada, Hiroki Nakahara 2021

机译：基于FPGA的层间流水线加速器，用于滤波器的重量平衡的稀疏完全卷积网络，具有重叠的平铺

Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators

摘要

著录项

相似文献

相关主题

期刊订阅