A Learning Framework for n-Bit Quantized Neural Networks Toward FPGAs

Chen Jun; Liu Liang; Liu Yong; Zeng Xianfang

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >A Learning Framework for n-Bit Quantized Neural Networks Toward FPGAs

【24h】

A Learning Framework for n-Bit Quantized Neural Networks Toward FPGAs

机译：对FPGA的N位量化神经网络的学习框架

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The quantized neural network (QNN) is an efficient approach for network compression and can be widely used in the implementation of field-programmable gate arrays (FPGAs). This article proposes a novel learning framework for n-bit QNNs, whose weights are constrained to the power of two. To solve the gradient vanishing problem, we propose a reconstructed gradient function for QNNs in the back-propagation algorithm that can directly get the real gradient rather than estimating an approximate gradient of the expected loss. We also propose a novel QNN structure named n-BQ-NN, which uses shift operation to replace the multiply operation and is more suitable for the inference on FPGAs. Furthermore, we also design a shift vector processing element (SVPE) array to replace all 16-bit multiplications with SHIFT operations in convolution operation on FPGAs. We also carry out comparable experiments to evaluate our framework. The experimental results show that the quantized models of ResNet, DenseNet, and AlexNet through our learning framework can achieve almost the same accuracies with the original full-precision models. Moreover, when using our learning framework to train our n-BQ-NN from scratch, it can achieve state-of-the-art results compared with typical low-precision QNNs. Experiments on Xilinx ZCU102 platform show that our n-BQ-NN with our SVPE can execute 2.9 times faster than that with the vector processing element (VPE) in inference. As the SHIFT operation in our SVPE array will not consume digital signal processing (DSP) resources on FPGAs, the experiments have shown that the use of SVPE array also reduces average energy consumption to 68.7% of the VPE array with 16 bit.

机译：量化的神经网络（QNN）是网络压缩的有效方法，并且可以广泛用于现场可编程门阵列（FPGA）的实现。本文提出了一种新的N位QNN学习框架，其权重被限制为两个。为了解决梯度消失问题，我们提出了在后传播算法中的QNN的重建梯度函数，可以直接获得真实梯度而不是估计预期损失的近似梯度。我们还提出了一种名为N-BQ-NN的新型QNN结构，其使用换档操作来更换乘法操作，更适合于对FPGA的推断。此外，我们还设计了一个移位矢量处理元件（SVPE）阵列，以替换所有16位乘法，在FPGA上的卷积操作中的换档操作。我们还对评估我们的框架进行了可比实验。实验结果表明，Reset，DenSenet和AlexNet的量化模型通过我们的学习框架可以实现与原始的全精密型号几乎相同的准确性。此外，在使用我们的学习框架培训我们的N-BQ-NN的划痕时，它可以实现最先进的结果与典型的低精度QNN相比。 Xilinx ZCU102平台上的实验表明，我们的N-BQ-NN与我们的SVPE可以比推断的矢量处理元件（VPE）快2.9倍。随着SVPE阵列中的换档操作不会在FPGA上消耗数字信号处理（DSP）资源，实验表明，使用SVPE阵列的使用也将平均能耗降低到具有16位的VPE阵列的68.7％。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2021年第3期|1067-1081|共15页
作者
Chen Jun; Liu Liang; Liu Yong; Zeng Xianfang;
展开▼
作者单位

Zhejiang Univ Inst Cyber Syst & Control Hangzhou 310027 Peoples R China;

Zhejiang Univ Inst Cyber Syst & Control Hangzhou 310027 Peoples R China;

Zhejiang Univ Inst Cyber Syst & Control Hangzhou 310027 Peoples R China;

Zhejiang Univ Inst Cyber Syst & Control Hangzhou 310027 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep compression; deep learning; field-programmable gate array (FPGA); quantized neural network (QNN);

机译：深压缩;深度学习;现场可编程门阵列（FPGA）;量化神经网络（ANN）;

相似文献

外文文献
中文文献
专利

1. FINN-R: An End-to-End Deep-Learning Framework for Fast Exploration of Quantized Neural Networks [J] . Blott Michaela, Preusser Thomas B., Fraser Nicholas J., ACM transactions on reconfigurable technology and systems . 2018,第3期

机译：FINN-R：快速探索量化神经网络的端到端深度学习框架
2. RSNN: A Software/Hardware Co-Optimized Framework for Sparse Convolutional Neural Networks on FPGAs [J] . Weijie You, Chang Wu Quality Control, Transactions . 2021,第1期

机译：RSNN：FPGA上的稀疏卷积神经网络的软件/硬件共同优化框架
3. Selection and Re-learning of learning data based on quantization of weights in multi-layered neural networks [J] . Eiji Watanabe 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2000,第617期

机译：基于多层神经网络的权重量化的学习数据的选择与重新学习
4. Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework [C] . Sung-En Chang, Yanyu Li, Mengshu Sun, IEEE International Symposium on High Performance Computer Architecture . 2021

机译：混合和匹配：一种以FPGA为中心的深度神经网络量化框架
5. Caffeinated FPGAs: FPGA Framework for Training and Inference of Convolutional Neural Networks With Reduced Precision Floating-Point Arithmetic [D] . DiCecco, Roberto. 2018

机译：含咖啡因的FPGA：用于训练和推理卷积神经网络的FPGA框架，具有降低的精度浮点算法
6. FPGA-Based Hybrid-Type Implementation of Quantized Neural Networks for Remote Sensing Applications [O] . Xin Wei, Wenchao Liu, Lei Chen, 2019

机译：遥感应用中基于FPGA的量化神经网络混合类型实现
7. fpgaConvNet: A framework for mapping convolutional neural networks on FPGAs [O] . Venieris SI, Bouganis C-S 2016

机译：fpgaConvNet：用于在FpGa上映射卷积神经网络的框架

A Learning Framework for n-Bit Quantized Neural Networks Toward FPGAs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅