FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference

机译：FLightNNs：轻量化的深度神经网络，可进行快速，准确的推断

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To improve the throughput and energy efficiency of Deep Neural Networks (DNNs) on customized hardware, lightweight neural networks constrain the weights of DNNs to be a limited combination (denoted as k $in$ {1, 2}) of powers of 2. In such networks, the multiply-accumulate operation can be replaced with a single shift operation, or two shifts and an add operation. To provide even more design flexibility, the k for each convolutional filter can be optimally chosen instead of being fixed for every filter. In this paper, we formulate the selection of k to be differentiable, and describe model training for determining k-based weights on a per-filter basis. Over 46 FPGA-design experiments involvmg eight configurations and four data sets reveal that lightweight neural networks with a flexible k value (dubbed FLightNNs) fully utilize the hardware resources on Field Programmable Gate Arrays (FPGAs), our experimental results show that FLightNNs can achieve $2imes$ speedup when compared to lightweight NNs with k = 2, with only 0.1% accuracy degradation. Compared to a 4-bit fixed-point quantization, FLightNNs achieve higher accuracy and up to $2imes$ inference speedup, due to their lightweight shift operations. In adchtion, our experiments also demonstrate that FLightNNs can achieve higher computational energy efficiency for ASIC implementation.CCS CONCEPTS• Computing methodologies $ightarrow$ Machine learning; • Hardware$ightarrow$Electronic design automation.

机译：为了提高自定义硬件上的深度神经网络（DNN）的吞吐量和能效，轻型神经网络将DNN的权重限制为2的幂的有限组合（表示为k $ \ in $ {1，2}）在这样的网络中，乘-累加运算可以用单移位运算或两个移位加法运算来代替。为了提供更大的设计灵活性，每个卷积滤波器的k可以最佳选择，而不是每个滤波器都固定。在本文中，我们将k的选择公式化为可微的，并描述了基于每个过滤器确定k权重的模型训练。超过46个FPGA设计实验涉及8种配置和4个数据集，结果表明，具有灵活k值的轻型神经网络（称为FLightNNs）充分利用了现场可编程门阵列（FPGA）上的硬件资源，我们的实验结果表明FLightNNs可以达到2美元与k = 2的轻量级NN相比，提高了\\ times $的速度，而精度下降仅为0.1％。与4位定点量化相比，由于FLightNN的轻量级移位运算，它们实现了更高的准确性，并且推理速度提高了2倍。此外，我们的实验还证明，FLightNNs可以为ASIC实现实现更高的计算能效。CCS概念•计算方法•硬件\\ rightarrow $电子设计自动化。

著录项

来源
《2019 56th ACM/IEEE Design Automation Conference》|2019年|1-6|共6页
会议地点 Las Vegas(US)
作者
Ruizhou Ding; Zeye Liu; Ting-Wu Chin; Diana Marculescu; R. D. Shawn Blanton;
展开▼
作者单位

Carnegie Mellon University, Pittsburgh, U.S.A.;

Carnegie Mellon University, Pittsburgh, U.S.A.;

Carnegie Mellon University, Pittsburgh, U.S.A.;

Carnegie Mellon University, Pittsburgh, U.S.A.;

Carnegie Mellon University, Pittsburgh, U.S.A.;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Hardware; Quantization (signal); Training; Energy efficiency; Neural networks; Field programmable gate arrays; Filtering algorithms;

机译：硬件;量化（信号）;培训;能源效率;神经网络;现场可编程门阵列;滤波算法;;

相似文献

外文文献
中文文献
专利

1. Stochastic Quantization for Learning Accurate Low-Bit Deep Neural Networks [J] . Dong Yinpeng, Ni Renkun, Li Jianguo, International Journal of Computer Vision . 2019,第11a12期

机译：用于学习精确低位深神经网络的随机量化
2. FINN-R: An End-to-End Deep-Learning Framework for Fast Exploration of Quantized Neural Networks [J] . Blott Michaela, Preusser Thomas B., Fraser Nicholas J., ACM transactions on reconfigurable technology and systems . 2018,第3期

机译：FINN-R：快速探索量化神经网络的端到端深度学习框架
3. Fast Inference Predictive Coding: A Novel Model for Constructing Deep Neural Networks [J] . Song Zengjie, Zhang Jiangshe, Shi Guang, Neural Networks and Learning Systems, IEEE Transactions on . 2019,第4期

机译：快速推理预测编码：构建深度神经网络的新模型
4. FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference [C] . Ruizhou Ding, Zeye Liu, Ting-Wu Chin, ACM/IEEE Design Automation Conference . 2019

机译：FLIGHTNNS：轻量级量化深度神经网络，用于快速准确推理
5. Fast Algorithm For Quantized Convolutional Neural Networks [D] . Pappalardo, Alessandro. 2017

机译：用于量化卷积神经网络的快速算法
6. Fast and accurate Monte Carlo simulations of subdiffusive spatially resolved reflectance for a realistic optical fiber probe tip model aided by a deep neural network [O] . Yevhen Zelinskyi, Peter Naglič, Franjo Pernuš, 2020

机译：快速准确的蒙特卡罗模拟用于深度神经网络的逼真光纤探头尖端模型的沉淀物空间上解决的反射率
7. Accurate brain age prediction with lightweight deep neural networks [O] . Han Peng, Weikang Gong, Christian F. Beckmann, 2019

机译：用轻质深度神经网络预测精确的脑年龄预测

FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅