Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers

机译：使用乘数交替方向方法的递归神经网络语言模型的低位量化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The high memory consumption and computational costs of Recurrent neural network language models (RNNLMs) limit their wider application on resource constrained devices. In recent years, neural network quantization techniques that are capable of producing extremely low-bit compression, for example, binarized RNNLMs, are gaining increasing research interests. Directly training of quantized neural networks is difficult. By formulating quantized RNNLMs training as an optimization problem, this paper presents a novel method to train quantized RNNLMs from scratch using alternating direction methods of multipliers (ADMM). This method can also flexibly adjust the trade-off between the compression rate and model performance using tied low-bit quantization tables. Experiments on two tasks: Penn Treebank (PTB), and Switchboard (SWBD) suggest the proposed ADMM quantization achieved a model size compression factor of up to 31 times over the full precision baseline RNNLMs. Faster convergence of 5 times in model training over the baseline binarized RNNLM quantization was also obtained.

机译：循环神经网络语言模型（RNNLM）的高内存消耗和计算成本限制了它们在资源受限设备上的广泛应用。近年来，能够产生极低位压缩的神经网络量化技术（例如，二值化RNNLM）越来越受到研究兴趣。直接训练量化神经网络是困难的。通过将量化的RNNLM训练公式化为一个优化问题，本文提出了一种使用乘法器交替方向法（ADMM）从头训练量化RNNLM的新方法。该方法还可以使用绑定的低位量化表灵活地调整压缩率和模型性能之间的折衷。在两项任务上的实验：宾夕法尼亚树库（PTB）和配电板（SWBD）表明，建议的ADMM量化在全精度基线RNNLM上实现了高达31倍的模型大小压缩因子。在基线二值化RNNLM量化模型训练中，也获得了5倍的更快收敛性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|7939-7943|共5页
会议地点
作者
Junhao Xu; Xie Chen; Shoukang Hu; Jianwei Yu; Xunying Liu; Helen Meng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Language models; Recurrent neural networks; Quantization; Alternating direction methods of multipliers;

机译：语言模型;递归神经网络;量化;乘数的交替方向法;

相似文献

外文文献
中文文献
专利

1. Solving cyclic train timetabling problem through model reformulation: Extended time-space network construct and Alternating Direction Method of Multipliers methods [J] . Zhang Yongxiang, Peng Qiyuan, Yao Yu, Transportation research . 2019,第Octa期

机译：通过模型重构解决循环火车时间表问题：扩展时空网络构造和乘数交替方向法
2. Solving cyclic train timetabling problem through model reformulation: Extended time-space network construct and Alternating Direction Method of Multipliers methods [J] . Zhang Yongxiang, Peng Qiyuan, Yao Yu, Transportation research . 2019,第OCTa期

机译：通过模型重构解决循环火车时间表问题：扩展时空网络构造和乘数交替方向法
3. Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models [J] . Xunying Liu, Xie Chen, Yongqiang Wang, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第8期

机译：使用递归神经网络语言模型的两种有效格点记录方法
4. Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers [C] . Junhao Xu, Xie Chen, Shoukang Hu, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：使用乘法器的交替方向方法的经常性神经网络语言模型的低比特量化
5. Learning Bayesian Networks via the Alternating Direction Method of Multipliers [D] . Li, Jinchao 2015

机译：通过乘数的交替方向方法学习贝叶斯网络
6. A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks [O] . Xin Long, XiangRong Zeng, Zongcheng Ben, 2020

机译：一种用于压缩深度神经网络的新型低位量化策略
7. Decentralized Principal Component Analysis by Integrating Lagrange Programming Neural Networks With Alternating Direction Method of Multipliers [O] . Zhonghua Ye, Hong Zhu 2020

机译：分散的主成分分析通过将拉格朗日编程神经网络与乘法器交替方向方法集成

Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers

摘要

著录项

相似文献

相关主题

期刊订阅