首页> 外文OA文献 >BinaryRelax: A Relaxation Approach for Training Deep Neural Networks with Quantized Weights

【2h】

BinaryRelax: A Relaxation Approach for Training Deep Neural Networks with Quantized Weights

机译：BinaryRelax：具有量化重量的深神经网络的放松方法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose BinaryRelax, a simple two-phase algorithm, for training deepneural networks with quantized weights. The set constraint that characterizesthe quantization of weights is not imposed until the late stage of training,and a sequence of pseudo quantized weights is maintained. Specifically, werelax the hard constraint into a continuous regularizer via Moreau envelope,which turns out to be the squared Euclidean distance to the set of quantizedweights. The pseudo quantized weights are obtained by linearly interpolatingbetween the float weights and their quantizations. A continuation strategy isadopted to push the weights towards the quantized state by gradually increasingthe regularization parameter. In the second phase, exact quantization schemewith a small learning rate is invoked to guarantee fully quantized weights. Wetest BinaryRelax on the benchmark CIFAR-10 and CIFAR-100 color image datasetsto demonstrate the superiority of the relaxed quantization approach and theimproved accuracy over the state-of-the-art training methods. Finally, we provethe convergence of BinaryRelax under an approximate orthogonality condition.

机译：我们提出了一种简单的两相算法，用于培训具有量化重量的深度网络的BinaryRelax。在训练后期的阶段和伪量化权重的阶段，不施加特征测量的结构约束。具体而言，通过莫鲁安包络将难度约束成连续规范器，这反过来是与占量子量的平方欧几里德距离。通过线性地插入浮法重量及其量化来获得伪量化的重量。延续的延续策略通过逐步增加正则化参数来将权重推向量化状态。在第二阶段中，调用精确量化方案，以确保完全量化的权重。基准CiFar-10和CiFar-100彩色图像DataseTSO最炙手可热的BinaryRelax展示了放宽量化方法的优越性和最先进的训练方法的优势。最后，我们在近似正交性条件下捕获了Biniarrelax的会聚。

著录项

作者
Penghang Yin; Shuai Zhang; Jiancheng Lyu; Stanley Osher; Yingyong Qi; Jack Xin;
展开▼
作者单位

展开▼
年度 2018
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework [J] . Deng Lei, Jiao Peng, Pei Jing, Neural Networks: The Official Journal of the International Neural Network Society . 2018,第期

机译：GXNOR-NET：在统一的离散化框架下，使用三元权重和激活，在没有全精密内存的情况下培训深神经网络
2. Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation based Non-uniform Quantizer [J] . Sanghyun Seo, Juntae Kim Applied Sciences . 2019,第12期

机译：基于基于核密度估计的非统一量化器的高效权重量化卷积神经网络
3. Memristive Quantized Neural Networks: A Novel Approach to Accelerate Deep Learning On-Chip [J] . Zhang Yang, Cui Menglin, Shen Linlin, Cybernetics, IEEE Transactions on . 2021,第4期

机译：回忆量化神经网络：一种加快芯片深度学习的新方法
4. Post-training Quantization of Deep Neural Network Weights [C] . E. M. Khayrov, M. Yu. Malsagov, I. M. Karandashev International conference on neuroinformatics . 2020

机译：深神经网络重量的后训练量化
5. Pipelined Training with Stale Weights of Deep Convolutional Neural Networks [D] . ?Zhang, Lifu 2020

机译：流水线训练与深卷积神经网络的陈旧重量
6. A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks [O] . Xin Long, XiangRong Zeng, Zongcheng Ben, 2020

机译：一种用于压缩深度神经网络的新型低位量化策略
7. Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks [O] . Tuan Hoang, Thanh-Toan Do, Tam V. Nguyen, 2020

机译：直接量化训练高精度低位宽深神经网络

BinaryRelax: A Relaxation Approach for Training Deep Neural Networks with Quantized Weights

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅