首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >On training the recurrent neural network encoder-decoder for large vocabulary end-to-end speech recognition
【24h】

On training the recurrent neural network encoder-decoder for large vocabulary end-to-end speech recognition

机译:训练递归神经网络编解码器进行大词汇量端到端语音识别

获取原文

摘要

Recently, there has been an increasing interest in end-to-end speech recognition using neural networks, with no reliance on hidden Markov models (HMMs) for sequence modelling as in the standard hybrid framework. The recurrent neural network (RNN) encoderdecoder is such a model, performing sequence to sequence mapping without any predefined alignment. This model first transforms the input sequence into a fixed length vector representation, from which the decoder recovers the output sequence. In this paper, we extend our previous work on this model for large vocabulary end-to-end speech recognition. We first present a more effective stochastic gradient decent (SGD) learning rate schedule that can significantly improve the recognition accuracy. We then extend the decoder with long memory by introducing another recurrent layer that performs implicit language modelling. Finally, we demonstrate that using multiple recurrent layers in the encoder can reduce the word error rate. Our experiments were carried out on the Switchboard corpus using a training set of around 300 hours of transcribed audio data, and we have achieved significantly higher recognition accuracy, thereby reduced the gap compared to the hybrid baseline.
机译:最近,人们越来越关注使用神经网络的端到端语音识别,并且不像标准的混合框架那样依赖于隐马尔可夫模型(HMM)进行序列建模。递归神经网络(RNN)编码器/解码器就是这种模型,无需任何预定义的对齐即可执行序列到序列的映射。该模型首先将输入序列转换为固定长度的向量表示形式,解码器从中恢复输出序列。在本文中,我们将先前在此模型上的工作扩展到大词汇量端到端语音识别。我们首先提出一种更有效的随机梯度体面(SGD)学习速率计划,该计划可以显着提高识别精度。然后,我们通过引入执行隐式语言建模的另一个递归层来扩展具有较长内存的解码器。最后,我们证明了在编码器中使用多个循环层可以减少字错误率。我们的实验是在Switchboard语料库上使用大约300个小时的转录音频数据进行训练的,并且我们已经实现了明显更高的识别准确度,从而与混合基线相比减少了差距。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号