Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition

机译：利用深度和高速公路连接在卷积经常性深神经网络中的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural network models have achieved considerable success in a wide range of fields. Several architectures have been proposed to alleviate the vanishing gradient problem, and hence enable training of very deep networks. In the speech recognition area, convolutional neural networks, recurrent neural networks, and fully connected deep neural networks have been shown to be complimentary in their modeling capabilities. Combining all three components, called CLDNN, yields the best performance to date. In this paper, we extend the CLDNN model by introducing a highway connection between LSTM layers, which enables direct information flow from cells of lower layers to cells of upper layers. With this design, we are able to better exploit the advantages of a deeper structure. Experiments on the GALE Chinese Broadcast Conversation/News Speech dataset indicate that our model outperforms all previous models and achieves a new benchmark, which is 22.41% character error rate on the dataset.

机译：深度神经网络模型在各种领域取得了相当大的成功。已经提出了几种架构来缓解消失的渐变问题，因此能够培训非常深的网络。在语音识别区域中，卷积神经网络，经常性神经网络和完全连接的深神经网络已被证明在其建模能力中是互补的。结合所有三个组件，称为CLDNN，会产生最佳性能。在本文中，我们通过在LSTM层之间引入高速公路连接来扩展CLDNN模型，这使得能够从下层的电池流到上层的电池。通过这种设计，我们能够更好地利用更深层次的结构的优势。巨大的巨大播放对话/新闻语音数据集的实验表明我们的模型优于所有以前的模型，并实现了新的基准，在数据集中是22.41％的字符错误率。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|744p|共5页
会议地点
作者
Wei-Ning Hsu; Yu Zhang; Ann Lee; James Glass;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词
入库时间 2022-08-21 11:41:05

相似文献

外文文献
中文文献
专利

1. Speech Emotion Recognition Using Deep Convolutional Neural Network and Simple Recurrent Unit [J] . Pengxu Jiang, Hongliang Fu, Huawei Tao Engineering Letters . 2019,第4期

机译：使用深卷积神经网络和简单复发单元的语音情感识别
2. Learning Deep Binaural Representations With Deep Convolutional Neural Networks for Spontaneous Speech Emotion Recognition [J] . Zhang Shiqing, Chen Aihua, Guo Wenping, Quality Control, Transactions . 2020,第期

机译：学习深层卷积神经网络的深层双耳陈述，用于自发言论情绪识别
3. 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition [J] . Mingyi Chen, Xuanji He, Jing Yang, IEEE signal processing letters . 2018,第10期

机译：具有注意力模型的3-D卷积递归神经网络用于语音情感识别
4. Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition [C] . Wei-Ning Hsu, Yu Zhang, Ann Lee, Annual Conference of the International Speech Communication Association . 2016

机译：卷积性经常性深神经网络中的深度和高速公路连接进行语音识别
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition [O] . Francisco Javier Ordóñez, Daniel Roggen 2016

机译：深度卷积和LSTM递归神经网络用于多模式可穿戴活动识别
7. Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition [O] . Lu, Liang, Renals, Steve 2016

机译：具有公路连接的小型深度神经网络用于语音识别

Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅