Towards Effective Low-bitwidth Convolutional Neural Networks

机译：朝向有效的低位卷积神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper tackles the problem of training a deep convolutional neural network with both low-precision weights and low-bitwidth activations. Optimizing a low-precision network is very challenging since the training process can easily get trapped in a poor local minima, which results in substantial accuracy loss. To mitigate this problem, we propose three simple-yet-effective approaches to improve the network training. First, we propose to use a two-stage optimization strategy to progressively find good local minima. Specifically, we propose to first optimize a net with quantized weights and then quantized activations. This is in contrast to the traditional methods which optimize them simultaneously. Second, following a similar spirit of the first method, we propose another progressive optimization approach which progressively decreases the bit-width from high-precision to low-precision during the course of training. Third, we adopt a novel learning scheme to jointly train a full-precision model alongside the low-precision one. By doing so, the full-precision model provides hints to guide the low-precision model training. Extensive experiments on various datasets (i.e., CIFAR-100 and ImageNet) show the effectiveness of the proposed methods. To highlight, using our methods to train a 4-bit precision network leads to no performance decrease in comparison with its full-precision counterpart with standard network architectures (i.e., AlexNet and ResNet-50).

机译：本文用低精度权重和低比特宽度激活训练深度卷积神经网络的问题。优化低精度网络非常具有挑战性，因为培训过程很容易被困在众多局部最小值中，这导致了实质性的精度损失。为缓解此问题，我们提出了三种简单尚有效的方法来改善网络培训。首先，我们建议使用两级优化策略来逐步找到良好的当地最小值。具体地，我们建议首先优化具有量化权重的网络，然后是量化的激活。这与传统方法相反，传统方法同时优化它们。其次，遵循类似于第一种方法的精神，我们提出了另一种渐进的优化方法，在训练过程中，逐步降低了从高精度到低精度的钻头宽度。第三，我们采用一种新颖的学习计划，共同培训一个全精度模型，与低精度一致。通过这样做，全精密型号提供了引导低精度模型培训的提示。在各种数据集（即CiFar-100和ImageNet）上进行了广泛的实验，显示了所提出的方法的有效性。要突出显示，使用我们培训4位精密网络的方法，与标准网络架构（即亚历纳网和resnet-50）的全精密对应物相比，无需性能下降。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|cxxii 7297-8023 p. :|共9页
会议地点
作者
Bohan Zhuang; Chunhua Shen; Mingkui Tan; Lingqiao Liu; Ian Reid;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP14-532;
关键词
Towards Effective; Low-bitwidth Convolutional; Neural Networks;

机译：朝向有效;低位卷大卷积;神经网络;
入库时间 2022-08-21 03:17:32

相似文献

外文文献
中文文献
专利

1. A neural network ensemble method for effective crack segmentation using fully convolutional networks and multi-scale structured forests [J] . Sen Wang, Xing Wu, Yinghui Zhang, Machine Vision and Applications . 2020,第7a8期

机译：一种使用完全卷积网络和多尺度结构森林的有效裂缝分割的神经网络集合方法
2. Convolutional neural networks for relevance feedback in content based image retrieval A Content based image retrieval system that exploits convolutional neural networks both for feature extraction and for relevance feedback [J] . Lorenzo Putzu, Luca Piras, Giorgio Giacinto Multimedia Tools and Applications . 2020,第37a38期

机译：基于内容的图像检索的相关反馈的卷积神经网络基于内容的图像检索系统，用于利用特征提取和相关性反馈的卷积神经网络
3. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
4. Towards Effective Low-Bitwidth Convolutional Neural Networks [C] . Bohan Zhuang, Chunhua Shen, Mingkui Tan, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：迈向有效的低位宽卷积神经网络
5. Designing an Effective Model for Traffic Signs Classification Based on Deep Convolutional Neural Network [D] . Alam, Shariful. 2018

机译：基于深度卷积神经网络设计交通符号分类有效模型
6. 3D Convolutional Neural Networks Initialized from Pretrained 2D Convolutional Neural Networks for Classification of Industrial Parts [O] . Ibon Merino, Jon Azpiazu, Anthony Remazeilles, 2021

机译：3D卷积神经网络从佩带的2D卷积神经网络初始化用于工业部件的分类
7. Modified Convolutional Neural Networks Architecture for Hyperspectral Image Classification (Extra‐Convolutional Neural Networks) [O] . Maissa HAMOUDA, Med Salim BOUHLEL 2021

机译：用于高光谱图像分类的修改的卷积神经网络（超卷积神经网络）

Towards Effective Low-bitwidth Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅