Deep compression of convolutional neural networks with low‐rank approximation

Marcella Astrid; Seung‐Ik Lee

首页> 外文期刊>ETRI journal >Deep compression of convolutional neural networks with low‐rank approximation

【24h】

Deep compression of convolutional neural networks with low‐rank approximation

机译：低秩逼近的卷积神经网络深度压缩

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The application of deep neural networks (DNNs) to connect the world with cyber physical systems (CPSs) has attracted much attention. However, DNNs require a large amount of memory and computational cost, which hinders their use in the relatively low‐end smart devices that are widely used in CPSs. In this paper, we aim to determine whether DNNs can be efficiently deployed and operated in low‐end smart devices. To do this, we develop a method to reduce the memory requirement of DNNs and increase the inference speed, while maintaining the performance (for example, accuracy) close to the original level. The parameters of DNNs are decomposed using a hybrid of canonical polyadic–singular value decomposition, approximated using a tensor power method, and fine‐tuned by performing iterative one‐shot hybrid fine‐tuning to recover from a decreased accuracy. In this study, we evaluate our method on frequently used networks. We also present results from extensive experiments on the effects of several fine‐tuning methods, the importance of iterative fine‐tuning, and decomposition techniques. We demonstrate the effectiveness of the proposed method by deploying compressed networks in smartphones.

机译：深度神经网络（DNN）将世界与网络物理系统（CPS）连接的应用备受关注。但是，DNN需要大量的内存和计算成本，这阻碍了它们在CPS中广泛使用的相对低端的智能设备中的使用。本文旨在确定DNN是否可以在低端智能设备中有效部署和运行。为此，我们开发了一种方法来减少DNN的内存需求并提高推理速度，同时保持性能（例如准确性）接近原始水平。 DNN的参数使用规范的多-奇异值分解的混合分解，使用张量幂方法进行近似，并通过执行迭代单次混合细调进行微调以从降低的精度中恢复。在这项研究中，我们评估了常用网络上的方法。我们还提供了关于几种微调方法的效果，迭代微调的重要性以及分解技术的广泛实验的结果。通过在智能手机中部署压缩网络，我们证明了该方法的有效性。

著录项

来源
《ETRI journal》 |2018年第4期|共14页
作者
Marcella Astrid; Seung‐Ik Lee;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Light Field Super-Resolution Using a Low-Rank Prior and Deep Convolutional Neural Networks [J] . Farrugia Reuben A., Guillemot Christine IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第5期

机译：光场超级分辨率使用低级先前和深卷积神经网络
2. £_1-norm low-rank linear approximation for accelerating deep neural networks [J] . Zhao Zhiqun, Wang Hengyou, Sun Hao, Neurocomputing . 2020,第Auga4期

机译：_1常态低级线性近似，用于加速深神经网络
3. Low-Rank Deep Convolutional Neural Network for Multitask Learning [J] . Fang Su, Hai-Yang Shang, Jing-Yan Wang Computational intelligence and neuroscience . 2019,第4期

机译：用于多任务学习的低排名深卷积神经网络
4. Constrained Optimization Based Low-Rank Approximation of Deep Neural Networks [C] . Chong Li, C. J. Richard Shi European conference on computer vision . 2018

机译：基于约束优化的深层神经网络低秩逼近
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Low-Rank Deep Convolutional Neural Network for Multitask Learning [O] . Fang Su, Hai-Yang Shang, Jing-Yan Wang 2019

机译：用于多任务学习的低秩深度卷积神经网络
7. Low-Rank Approximations for Conditional Feedforward Computation in Deep Neural Networks [O] . Davis, Andrew, Arel, Itamar 2014

机译：深度条件前馈计算的低秩近似神经网络

Deep compression of convolutional neural networks with low‐rank approximation

摘要

著录项

相似文献

相关主题

期刊订阅