IQNN: Training Quantized Neural Networks with Iterative Optimizations

机译：IQNN：通过迭代优化来训练量化神经网络

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Quantized Neural Networks (QNNs) use low bitwidth numbers for representing parameters and intermediate results. The lowering of bitwidths saves storage space and allows for exploiting bitwise operar tions to speed up computations. However, QNNs often have lower prediction accuracies than their floating point counterpaxts, due to the extra quantization errors. In this paper, we propose a quantization algorithm that iteratively solves for the optimal scaling factor during every forward pass, which significantly reduces quantization errors. Moreover, we propose a novel initialization method for the iterative quantization, which speeds up convergence and further reduces quantization errors. Overall, our method improves prediction accuracies of QNNs at no extra costs for the inference. Experiments confirm the efficacy of our method in the quantization of AlexNet, GoogLeNet and ResNet. In particular, we are able to train a GoogLeNet having 4-bit weights and activations to reach 11.4% in top-5 single-crop error on ImageNet dataset, outperforming state-of-the-art QNNs. The code will be available online.

机译：量化神经网络（QNN）使用低位宽数字表示参数和中间结果。降低位宽可节省存储空间，并允许利用按位运算来加快计算速度。然而，由于额外的量化误差，QNN的预测精度通常比其浮点数低。在本文中，我们提出了一种量化算法，该算法迭代地求解每次正向传递过程中的最佳缩放比例，从而显着减少了量化误差。此外，我们提出了一种新颖的迭代量化初始化方法，该方法可以加快收敛速度并进一步减少量化误差。总体而言，我们的方法可以提高QNN的预测准确性，而无需为推理进行任何额外的操作。实验证实了我们的方法在AlexNet，GoogLeNet和ResNet量化中的有效性。特别是，我们能够训练一个具有4位权重和激活的GoogLeNet，使其在ImageNet数据集的前5个单裁剪错误中达到11.4％，胜过最新的QNN。该代码将在线提供。

著录项

来源
《International conference on artificial neural networks》|2017年|688-695|共8页
会议地点
作者
Shuchang Zhou; He Wen; Taihong Xiao; Xinyu Zhou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Quantized Neural Network; Uniform quantization; Iterative quantization; Alternating least squares; Bitwise operation;

机译：量化神经网络;均匀量化;迭代量化;交替最小二乘;按位运算;

相似文献

外文文献
中文文献
专利

1. OptQuant: Distributed training of neural networks with optimized quantization mechanisms [J] . He Li, Zheng Shuxin, Chen Wei, Neurocomputing . 2019,第MAYa7期

机译：OptQuant：具有优化量化机制的分布式神经网络训练
2. Iterative surrogate model optimization (ISMO): An active learning algorithm for PDE constrained optimization with deep neural networks [J] . Lye Kjetil O., Mishra Siddhartha, Ray Deep, Computer Methods in Applied Mechanics and Engineering . 2021,第Feba1期

机译：迭代代理模型优化（ISMO）：具有深神经网络的PDE约束优化的主动学习算法
3. Training quantized one-stage object detection neural networks via selective feature imitation [J] . Jia Jiyuan, Zhou Li, Chen Jie Journal of electronic imaging . 2019,第4期

机译：通过选择性特征模仿训练量化的一阶段目标检测神经网络
4. IQNN: Training Quantized Neural Networks with Iterative Optimizations [C] . Shuchang Zhou, He Wen, Taihong Xiao, International Conference on Artificial Neural Networks . 2017

机译：IQNN：用迭代优化培训量化的神经网络
5. Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks [D] . Srivastava, Gaurav. 2018

机译：压缩深神经网络的量化和结构稀疏性的联合优化
6. Automatic Recognition of Holistic Functional Brain Networks Using Iteratively Optimized Convolutional Neural Networks (IO-CNN) with Weak Label Initialization [O] . Yu Zhao, Fangfei Ge, Tianming Liu -1

机译：使用带有弱标签初始化的迭代优化卷积神经网络（IO-CNN）自动识别整体功能性脑网络
7. Deep Neural Network Quantization via Layer-Wise Optimization Using Limited Training Data [O] . Shangyu Chen, Wenya Wang, Sinno Jialin Pan 2019

机译：通过使用有限训练数据的层面优化的深度神经网络量化

IQNN: Training Quantized Neural Networks with Iterative Optimizations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅