Learning Compact Recurrent Neural Networks with Block-Term Tensor Decomposition

机译：通过块项张量分解学习紧凑型递归神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recurrent Neural Networks (RNNs) are powerful sequence modeling tools. However, when dealing with high dimensional inputs, the training of RNNs becomes computational expensive due to the large number of model parameters. This hinders RNNs from solving many important computer vision tasks, such as Action Recognition in Videos and Image Captioning. To overcome this problem, we propose a compact and flexible structure, namely Block-Term tensor decomposition, which greatly reduces the parameters of RNNs and improves their training efficiency. Compared with alternative low-rank approximations, such as tensortrain RNN (TT-RNN), our method, Block-Term RNN (BT-RNN), is not only more concise (when using the same rank), but also able to attain a better approximation to the original RNNs with much fewer parameters. On three challenging tasks, including Action Recognition in Videos, Image Captioning and Image Generation, BT-RNN outperforms TT-RNN and the standard RNN in terms of both prediction accuracy and convergence rate. Specifically, BT-LSTM utilizes 17,388 times fewer parameters than the standard LSTM to achieve an accuracy improvement over 15.6% in the Action Recognition task on the UCF11 dataset.

机译：递归神经网络（RNN）是功能强大的序列建模工具。然而，当处理高维输入时，由于大量的模型参数，训练RNN变得计算量大。这阻碍了RNN解决许多重要的计算机视觉任务，例如视频中的动作识别和图像字幕。为了克服这个问题，我们提出了一种紧凑而灵活的结构，即块项张量分解，它可以大大减少RNN的参数并提高其训练效率。与Tensortrain RNN（TT-RNN）等替代低秩近似相比，我们的方法Block-Term RNN（BT-RNN）不仅更简洁（当使用相同秩时），而且能够获得参数少得多的原始RNN的更好近似。在三项具有挑战性的任务中，包括视频中的动作识别，图像字幕和图像生成，BT-RNN在预测准确性和收敛速度方面均优于TT-RNN和标准RNN。具体而言，BT-LSTM使用的参数比标准LSTM少17388倍，从而在UCF11数据集上的“动作识别”任务中实现了超过15.6％的精度改进。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|9378-9387|共10页
会议地点 Salt Lake City(US)
作者
Jinmian Ye; Linnan Wang; Guangxi Li; Di Chen; Shandian Zhe; Xinqi Chu; Zenglin Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computational modeling; Data models; Recurrent neural networks; Computer architecture; Correlation; Matrix decomposition;

机译：计算建模；数据模型；递归神经网络；计算机架构;相关性矩阵分解;
入库时间 2022-08-26 14:35:32

相似文献

外文文献
中文文献
专利

1. On the Connection Between Learning Two-Layer Neural Networks and Tensor Decomposition [J] . Marco Mondelli, Andrea Montanari JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：学习两层神经网络与张量分解之间的联系
2. Short term prediction of wireless traffic based on tensor decomposition and recurrent neural network [J] . Deng Tao, Wan Mengxuan, Shi Kaiwen, SN Applied Sciences . 2021,第9期

机译：基于张量分解和经常性神经网络的无线流量的短期预测
3. Tensor rank learning in CP decomposition via convolutional neural network [J] . Zhou Mingyi, Liu Yipeng, Long Zhen, Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2019,第期

机译：张量在CP分解通过卷积神经网络等级学习
4. Learning Compact Recurrent Neural Networks with Block-Term Tensor Decomposition [C] . Jinmian Ye, Linnan Wang, Guangxi Li, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：学习紧凑型经常性神经网络，具有块术张量分解
5. Recurrent neural network learning and neural network learning controller. [D] . Yan, Lilai. 1994

机译：递归神经网络学习和神经网络学习控制器。
6. RnRTD: Intelligent Approach Based on the Relationship-Driven Neural Network and Restricted Tensor Decomposition for Multiple Accusation Judgment in Legal Cases [O] . Xiaoding Guo, Hongli Zhang, Lin Ye, 2019

机译：RnRTD：基于关系驱动神经网络和受限制张量分解的智能方法用于法律案件中的多次控告判决
7. Learning Compact Recurrent Neural Networks with Block-Term Tensor Decomposition [O] . Jinmian Ye, Linnan Wang, Guangxi Li, 2018

机译：学习紧凑型经常性神经网络，具有块术张量分解

Learning Compact Recurrent Neural Networks with Block-Term Tensor Decomposition

摘要

著录项

相似文献

相关主题

期刊订阅