Compression of Deep Neural Networks with Structured Sparse Ternary Coding

Boo Yoonho; Sung Wonyong

首页> 外文期刊>Journal of VLSI signal processing systems >Compression of Deep Neural Networks with Structured Sparse Ternary Coding

【24h】

Compression of Deep Neural Networks with Structured Sparse Ternary Coding

机译：结构稀疏三元编码的深度神经网络压缩

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks (DNNs) contain large number of weights, and usually require many off-chip memory accesses for inference. Weight size compression is a major requirement for on-chip memory based implementation of DNNs, which not only increases inference speed but also reduces power consumption. We propose a weight compression method for deep neural networks by combining pruning and quantization. The proposed method allows weights to have values of + 1 or - 1 only at predetermined positions. Then, a look-up table stores all possible combinations of sub-vectors of weight matrices. Encoding and decoding structured sparse weights can be conducted easily with the table. This method not only allows multiplication-free DNN implementations but also compresses the weight storage by as much as x32 times more than that in floating-point networks and with only a tiny performance loss. Weight distribution normalization and gradual pruning techniques are applied to lower performance degradation. Experiments are conducted with fully connected DNNs and convolutional neural networks.

机译：深度神经网络（DNN）包含大量权重，通常需要许多片外内存访问来进行推断。权重大小压缩是基于DNN的片上内存实现的主要要求，这不仅提高了推理速度，而且降低了功耗。我们提出了一种结合修剪和量化的深度神经网络权重压缩方法。所提出的方法允许权重仅在预定位置处具有+1或-1的值。然后，查找表存储权重矩阵的子向量的所有可能组合。使用该表可以轻松进行结构化的稀疏权重的编码和解码。这种方法不仅允许无乘法DNN的实现，而且将权重存储的压缩量是浮点网络中的权重存储的32倍以上，并且性能损失很小。权重分布归一化和逐步修剪技术可用于降低性能下降。使用完全连接的DNN和卷积神经网络进行实验。

著录项

来源
《Journal of VLSI signal processing systems》 |2019年第9期|1009-1019|共11页
作者
Boo Yoonho; Sung Wonyong;
展开▼
作者单位

Seoul Natl Univ, Neural Proc Res Ctr, Sch Elect Engn, Seoul 151744, South Korea;

Seoul Natl Univ, Neural Proc Res Ctr, Sch Elect Engn, Seoul 151744, South Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep neural networks; Weight compression; Structured sparsity; Fixed-point quantization; Pruning;

机译：深度神经网络;重量压缩;结构稀疏性;定点量化;修剪;

相似文献

外文文献
中文文献
专利

1. DEEP NEURAL NETWORKS FOR IRIS RECOGNITION SYSTEM BASED ON VIDEO: STACKED SPARSE AUTO ENCODERS (SSAE) AND BI-PROPAGATION NEURAL NETWORK MODELS [J] . ASAMA KUDER NSEAF, AZIZAH JAAFAR, KHIDER NASSIF JASSIM, Journal of Theoretical and Applied Information Technology . 2016,第2期

机译：基于视频的虹膜识别系统深层神经网络：堆叠稀疏自动编码器（SSAE）和双向传播神经网络模型
2. SNAP: An Efficient Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference [J] . Zhang Jie-Fang, Lee Ching-En, Liu Chester, IEEE Journal of Solid-State Circuits . 2021,第2期

机译：SNAP：一个有效的稀疏神经加速处理器，用于非结构化稀疏深神经网络推理
3. Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks [J] . Luo Weixin, Liu Wen, Lian Dongze, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021,第3期

机译：视频异常检测与稀疏编码启发深度神经网络
4. Structured sparse ternary weight coding of deep neural networks for efficient hardware implementations [C] . Yoonho Boo, Wonyong Sung IEEE International Workshop on Signal Processing Systems . 2017

机译：深层神经网络的结构化稀疏三元权重编码，可实现高效的硬件实现
5. Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks [D] . Srivastava, Gaurav. 2018

机译：压缩深神经网络的量化和结构稀疏性的联合优化
6. Hierarchical Sparse Coding of Objects in Deep Convolutional Neural Networks [O] . Xingyu Liu, Zonglei Zhen, Jia Liu 2020

机译：深度卷积神经网络中对象的分层稀疏编码
7. Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations [O] . Boo, Yoonho, Sung, Wonyong 2017

机译：深度神经网络的结构化稀疏三元权重编码高效的硬件实现

Compression of Deep Neural Networks with Structured Sparse Ternary Coding

摘要

著录项

相似文献

相关主题

期刊订阅