Sparse low rank factorization for deep neural network compression

Swaminathan Sridhar; Garg Deepak; Kannan Rajkumar; Andres Frederic

首页> 外文期刊>Neurocomputing >Sparse low rank factorization for deep neural network compression

【24h】

Sparse low rank factorization for deep neural network compression

机译：深度神经网络压缩的稀疏低等级分解

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Storing and processing millions of parameters in deep neural networks is highly challenging during the deployment of model in real-time application on resource constrained devices. Popular low-rank approximation approach singular value decomposition (SVD) is generally applied to the weights of fully connected layers where compact storage is achieved by keeping only the most prominent components of the decomposed matrices. Years of research on pruning-based neural network model compression revealed that the relative importance or contribution of each neuron in a layer highly vary among each other. Recently, synapses pruning has also demonstrated that having sparse matrices in network architecture achieve lower space and faster computation during inference time. We extend these arguments by proposing that the low-rank decomposition of weight matrices should also consider significance of both input as well as output neurons of a layer. Combining the ideas of sparsity and existence of unequal contributions of neurons towards achieving the target, we propose sparse low rank (SLR) method which sparsifies SVD matrices to achieve better compression rate by keeping lower rank for unimportant neurons. We demonstrate the effectiveness of our method in compressing famous convolutional neural networks based image recognition frameworks which are trained on popular datasets. Experimental results show that the proposed approach SLR outperforms vanilla truncated SVD and a pruning baseline, achieving better compression rates with minimal or no loss in the accuracy. Code of the proposed approach is avaialble at https://github.com/sridarah/slr. (C) 2020 Elsevier B.V. All rights reserved.

机译：在资源受限设备上的实时应用中部署模型期间，在深神经网络中存储和处理数百万个参数在高度挑战。受欢迎的低秩近似方法奇异值分解（SVD）通常应用于通过保持分解矩阵的最突出的组件来实现紧凑的存储器的完全连接层的重量。基于修剪的神经网络模型压缩的多年的研究表明，每个神经元在层中的层中的相对重要性或贡献在彼此之间非常不同。最近，突触修剪还证明了网络架构中的稀疏矩阵在推理时间内实现了较低的空间和更快的计算。我们通过提出重量矩阵的低秩分解还应考虑两个输入的重要性以及层的输出神经元的重要性。结合稀疏性的思想和神经元不平等贡献朝向实现目标，我们提出了稀疏的低等级（SLR）方法，使SVD基质缩小以通过保持不重要神经元的较低等级来实现更好的压缩率。我们展示了我们在压缩着着名的基于卷积神经网络的图像识别框架中的方法，该图像识别框架在受欢迎的数据集上训练。实验结果表明，所提出的方法SLR优于Vanilla截短的SVD和修剪基线，实现更好的压缩速率，精度最小或没有损失。所提出的方法的代码是在https://github.com/sridarah/slr处的avaialble。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Neurocomputing》 |2020年第jul20期|185-196|共12页
作者
Swaminathan Sridhar; Garg Deepak; Kannan Rajkumar; Andres Frederic;
展开▼
作者单位

Bennett Univ Greater Noida 201310 India;

Bennett Univ Greater Noida 201310 India;

Bishop Heber Coll Autonomous Tiruchirappalli 620017 India;

Natl Inst Informat Tokyo Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Low-rank approximation; Singular value decomposition; Sparse matrix; Deep neural networks; Convolutional neural networks;

机译：低秩近似;奇异值分解;稀疏矩阵;深神经网络;卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Deep compression of convolutional neural networks with low‐rank approximation [J] . Marcella Astrid, Seung‐Ik Lee ETRI journal . 2018,第4期

机译：低秩逼近的卷积神经网络深度压缩
2. Low-Rank and Sparse Based Deep-Fusion Convolutional Neural Network for Crowd Counting [J] . Tang Siqi, Pan Zhisong, Zhou Xingyu Mathematical Problems in Engineering . 2017,第pta9期

机译：基于低秩和稀疏的深度融合卷积神经网络用于人群计数
3. Compressing Deep Neural Networks With Sparse Matrix Factorization [J] . Wu Kailun, Guo Yiwen, Zhang Changshui Neural Networks and Learning Systems, IEEE Transactions on . 2020,第10期

机译：压缩具有稀疏矩阵分解的深神经网络
4. Deep neural network based acoustic model parameter reduction using manifold regularized low rank matrix factorization [C] . Hoon Chung, Jeom Ja Kang, Ki Young Park, IEEE Workshop on Spoken Language Technology . 2016

机译：基于流形正则化低秩矩阵分解的基于深度神经网络的声学模型参数约简
5. Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks [D] . Srivastava, Gaurav. 2018

机译：压缩深神经网络的量化和结构稀疏性的联合优化
6. Restoration of Full Data from Sparse Data in Low-Dose Chest Digital Tomosynthesis Using Deep Convolutional Neural Networks [O] . Donghoon Lee, Hee-Joung Kim 2019

机译：使用深度卷积神经网络从低剂量胸部数字断层合成中的稀疏数据恢复完整数据
7. Low-rank matrix factorization for deep neural network training with high-dimensional output targets [O] . Tara N. Sainath, Brian Kingsbury, Vikas Sindhwani, 2013

机译：具有高维输出目标的深度神经网络训练的低秩矩阵分解

Sparse low rank factorization for deep neural network compression

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅