Hessian-free Optimization for Learning Deep Multidimensional Recurrent Neural Networks

机译：用于学习深度多维递归神经网络的无粗麻优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multidimensional recurrent neural networks (MDRNNs) have shown a remarkable performance in the area of speech and handwriting recognition. The performance of an MDRNN is improved by further increasing its depth, and the difficulty of learning the deeper network is overcome by using Hessian-free (HF) optimization. Given that connectionist temporal classification (CTC) is utilized as an objective of learning an MDRNN for sequence labeling, the non-convexity of CTC poses a problem when applying HF to the network. As a solution, a convex approximation of CTC is formulated and its relationship with the EM algorithm and the Fisher information matrix is discussed. An MDRNN up to a depth of 15 layers is successfully trained using HF, resulting in an improved performance for sequence labeling.

机译：多维递归神经网络（MDRNN）在语音和手写识别领域表现出了卓越的性能。进一步增加MDRNN的深度可提高其性能，并通过使用无Hessian（HF）优化来克服学习更深层网络的困难。考虑到使用连接主义的时间分类（CTC）作为学习MDRNN进行序列标记的目标，当将HF应用于网络时，CTC的非凸性会带来问题。作为解决方案，制定了CTC的凸近似，并讨论了它与EM算法和Fisher信息矩阵的关系。使用HF成功地训练了高达15层深度的MDRNN，从而提高了序列标记的性能。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2015年|883-891|共9页
会议地点
作者
Minhyung Cho; Chandra Shekhar Dhir; Jaehyung Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Hessian-Free Gradient Flow (HFGF) method for the optimisation of deep learning neural networks [J] . Sushen Zhang, Ruijuan Chen, Wenyu Du, Computers & Chemical Engineering . 2020,第Octa4期

机译：一种无奇皮人的梯度流动（HFGF）方法，用于优化深学习神经网络
2. Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders: Deep Learning Approach [J] . Rumeng Li, Baotian Hu, Feifan Liu, JMIR Medical Informatics . 2019,第1期

机译：使用循环神经网络自动编码器增强的卷积神经网络模型检测电子病历中的出血事件：深度学习方法
3. Learning deep hierarchical and temporal recurrent neural networks with residual learning [J] . Zia Tehseen, Abbas Assad, Habib Usman, International journal of machine learning and cybernetics . 2020,第4期

机译：通过残差学习来学习深度分层和时间递归神经网络
4. Hessian-free Optimization for Learning Deep Multidimensional Recurrent Neural Networks [C] . Minhyung Cho, Chandra Shekhar Dhir, Jaehyung Lee Annual conference on Neural Information Processing Systems . 2015

机译：学习深层多维反复性神经网络的Hessian-Free优化
5. Convolutional Recurrent Neural Networks and Attention Mechanisms for Robust Deep Learning [D] . Zheng, Jian . 2019

机译：坚固深度学习的卷积经常性神经网络和注意力机制
6. Towards Lifespan Automation for Caenorhabditis elegans Based on Deep Learning: Analysing Convolutional and Recurrent Neural Networks for Dead or Live Classification [O] . Antonio García Garví, Joan Carles Puchalt, Pablo E. Layana Castro, 2021

机译：基于深度学习的Caenorhabditis的lefaliation自动化：分析死亡或实时分类的卷积和经常性神经网络
7. Training Deep and Recurrent Networks with Hessian-Free Optimization [O] . James Martens, Ilya Sutskever 2012

机译：使用无Hessian优化训练深度和循环网络

Hessian-free Optimization for Learning Deep Multidimensional Recurrent Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅