Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks

机译：在三元网络中使用稀疏性致正则化压缩低精度深神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A low precision deep neural network training technique for producing sparse, ternary neural networks is presented. The technique incorporates hardware implementation costs during training to achieve significant model compression for inference. Training involves three stages: network training using L2 regularization and a quantization threshold regularizer, quantization pruning, and finally retraining. Resulting networks achieve improved accuracy, reduced memory foot-print and reduced computational complexity compared with conventional methods, on MNIST and CIFAR10 datasets. Our networks are up to 98% sparse and 5 & 11 times smaller than equivalent binary and ternary models, translating to significant resource and speed benefits for hardware implementations.

机译：提出了一种用于生产稀疏，三元神经网络的低精度深度神经网络培训技术。该技术在训练期间包含硬件实施成本，以实现推理的显着模型压缩。培训涉及三个阶段：使用L2正则化的网络训练和量化阈值规范器，量化修剪，最后再培训。与传统方法相比，由MniSt和CiFar10数据集相比，所产生的网络实现了更高的准确性，降低了内存脚印和降低的计算复杂性。我们的网络稀疏高达98％，比同等二进制和三元模型小5倍和11倍，转化为硬件实现的大量资源和速度优势。

著录项

来源
《International Conference on Neural Information Processing》|2017年|926p|共12页
会议地点
作者
Julian Faraone; Nicholas Fraser; Giulio Gambardella; Michaela Blott; Philip H. W. Leong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP183-53;
关键词
Deep Neural Networks; Ternary Neural Network; Low-precision; Pruning; Sparsity; Compression;

机译：深神经网络;三元神经网络;低精度;修剪;稀疏;压缩;

相似文献

外文文献
中文文献
专利

1. GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework [J] . Deng Lei, Jiao Peng, Pei Jing, Neural Networks: The Official Journal of the International Neural Network Society . 2018,第期

机译：GXNOR-NET：在统一的离散化框架下，使用三元权重和激活，在没有全精密内存的情况下培训深神经网络
2. High-Precision Indoor Visible Light Positioning Using Deep Neural Network Based on the Bayesian Regularization With Sparse Training Point [J] . Zhang Haiqi, Cui Jiahe, Feng Lihui, Photonics Journal, IEEE . 2019,第3期

机译：基于具有稀疏训练点的贝叶斯正则化的深层神经网络高精度室内可见光定位
3. Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks [J] . Neural computing & applications . 2020,第9期

机译：使用成本敏感特征选择和基于集成的正则化深神经网络的社交网络垃圾邮件检测
4. Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks [C] . Julian Faraone, Nicholas Fraser, Giulio Gambardella, International Conference on Neural Information Processing . 2017

机译：在三元网络中使用稀疏性致正则化压缩低精度深神经网络
5. Adaptive Solution to Compress Deep Neural Networks for Resource-Constrained Devices [D] . Wang, Ruzhuo. 2019

机译：压缩资源受限设备的深度神经网络的自适应解决方案
6. A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks [O] . Xin Long, XiangRong Zeng, Zongcheng Ben, 2020

机译：一种用于压缩深度神经网络的新型低位量化策略
7. Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks [O] . Faraone, Julian, Fraser, Nicholas, Gambardella, Giulio, 2017

机译：用稀疏诱导法压缩低精度深度神经网络三元网络中的正则化

Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks

摘要

著录项

相似文献

相关主题

期刊订阅