Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularizat ion in Ternary Networks

机译：三元网络中稀疏性引起的正则化压缩低精度深度神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A low precision deep neural network training technique for producing sparse, ternary neural networks is presented. The technique incorporates hardware implementation costs during training to achieve significant model compression for inference. Training involves three stages: network training using L2 regularization and a quantization threshold regularizer, quantization pruning, and finally retraining. Resulting networks achieve improved accuracy, reduced memory footprint and reduced computational complexity compared with conventional methods, on MNIST and CIFAR10 datasets. Our networks are up to 98% sparse and 5 & 11 times smaller than equivalent binary and ternary models, translating to significant resource and speed benefits for hardware implementations.

机译：提出了一种用于生成稀疏三元神经网络的低精度深度神经网络训练技术。该技术在训练过程中并入了硬件实施成本，以实现显着的模型压缩以进行推理。训练包括三个阶段：使用L2正则化和量化阈值正则化器进行网络训练，量化修剪以及最后重新训练。与传统方法相比，结果网络在MNIST和CIFAR10数据集上实现了更高的准确性，更少的内存占用以及更低的计算复杂性。我们的网络稀疏度高达98％，比等效的二进制和三进制模型小5到11倍，这为硬件实现带来了显着的资源和速度优势。

著录项

来源
《International conference on neural information processing》|2017年|393-404|共12页
会议地点
作者
Julian Faraone; Nicholas Fraser; Giulio Gambardella; Michaela Blott; Philip H.W.Leong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep Neural Networks; Ternary Neural Network; Low-precision; Pruning; Sparsity; Compression;

机译：深度神经网络;三元神经网络;精度低;修剪;稀疏性压缩;

相似文献

外文文献
中文文献
专利

1. GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework [J] . Deng Lei, Jiao Peng, Pei Jing, Neural Networks: The Official Journal of the International Neural Network Society . 2018,第期

机译：GXNOR-NET：在统一的离散化框架下，使用三元权重和激活，在没有全精密内存的情况下培训深神经网络
2. A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks [J] . Xin Long, XiangRong Zeng, Zongcheng Ben, Computational intelligence and neuroscience . 2020,第4期

机译：一种用于压缩深神经网络的新型低位量化策略
3. Compressing deep-quaternion neural networks with targeted regularisation [J] . Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, CAAI Transactions on Intelligence Technology . 2020,第3期

机译：压缩具有目标正则化的深季度神经网络
4. Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks [C] . Julian Faraone, Nicholas Fraser, Giulio Gambardella, International Conference on Neural Information Processing . 2017

机译：在三元网络中使用稀疏性致正则化压缩低精度深神经网络
5. Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks [D] . Srivastava, Gaurav. 2018

机译：压缩深神经网络的量化和结构稀疏性的联合优化
6. A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks [O] . Xin Long, XiangRong Zeng, Zongcheng Ben, 2020

机译：一种用于压缩深度神经网络的新型低位量化策略
7. Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks [O] . Faraone, Julian, Fraser, Nicholas, Gambardella, Giulio, 2017

机译：用稀疏诱导法压缩低精度深度神经网络三元网络中的正则化

Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularizat ion in Ternary Networks

摘要

著录项

相似文献

相关主题

期刊订阅