首页> 外文会议>Annual Conference of the International Speech Communication Association >Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition
【24h】

Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition

机译:利用深度和高速公路连接在卷积经常性深神经网络中的语音识别

获取原文

摘要

Deep neural network models have achieved considerable success in a wide range of fields. Several architectures have been proposed to alleviate the vanishing gradient problem, and hence enable training of very deep networks. In the speech recognition area, convolutional neural networks, recurrent neural networks, and fully connected deep neural networks have been shown to be complimentary in their modeling capabilities. Combining all three components, called CLDNN, yields the best performance to date. In this paper, we extend the CLDNN model by introducing a highway connection between LSTM layers, which enables direct information flow from cells of lower layers to cells of upper layers. With this design, we are able to better exploit the advantages of a deeper structure. Experiments on the GALE Chinese Broadcast Conversation/News Speech dataset indicate that our model outperforms all previous models and achieves a new benchmark, which is 22.41% character error rate on the dataset.
机译:深度神经网络模型在各种领域取得了相当大的成功。 已经提出了几种架构来缓解消失的渐变问题,因此能够培训非常深的网络。 在语音识别区域中,卷积神经网络,经常性神经网络和完全连接的深神经网络已被证明在其建模能力中是互补的。 结合所有三个组件,称为CLDNN,会产生最佳性能。 在本文中,我们通过在LSTM层之间引入高速公路连接来扩展CLDNN模型,这使得能够从下层的电池流到上层的电池。 通过这种设计,我们能够更好地利用更深层次的结构的优势。 巨大的巨大播放对话/新闻语音数据集的实验表明我们的模型优于所有以前的模型,并实现了新的基准,在数据集中是22.41%的字符错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号