首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition
【24h】

Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition

机译:基于记忆控制的语音序列语音识别语言模型集成

获取原文

摘要

In this paper, we explore several new schemes to train a seq2seq model to integrate a pre-trained language model (LM). Our proposed fusion methods focus on the memory cell state and the hidden state in the seq2seq decoder long short-term memory (LSTM), and the memory cell state is updated by the LM unlike the prior studies. This means the memory retained by the main seq2seq would be adjusted by the external LM. These fusion methods have several variants depending on the architecture of this memory cell update and the use of memory cell and hidden states which directly affects the final label inference. We performed the experiments to show the effectiveness of the proposed methods in a mono-lingual ASR setup on the Librispeech corpus and in a transfer learning setup from a multilingual ASR (MLASR) base model to a low-resourced language. In Librispeech, our best model improved WER by 3.7%, 2.4% for test clean, test other relatively to the shallow fusion baseline, with multilevel decoding. In transfer learning from an MLASR base model to the IARPA Babel Swahili model, the best scheme improved the transferred model on eval set by 9.9%, 9.8% in CER, WER relatively to the 2-stage transfer baseline.
机译:在本文中,我们探索了几种新的方案来训练seq2seq模型以集成预训练的语言模型(LM)。我们提出的融合方法着眼于seq2seq解码器长短期存储器(LSTM)中的存储单元状态和隐藏状态,并且与现有技术不同,存储单元状态由LM更新。这意味着由主seq2seq保留的内存将由外部LM调整。这些融合方法有几种变体,具体取决于此存储单元更新的体系结构以及对存储单元的使用以及直接影响最终标签推断的隐藏状态。我们进行了实验,以表明所提出的方法在Librispeech语料库上的单语言ASR设置以及从多语言ASR(MLASR)基本模型到低资源语言的转移学习设置中的有效性。在Librispeech中,我们最好的模型将WER提高了3.7%,将测试清除率提高了2.4%,并使用浅层融合基线对其他测试进行了多级解码。在从MLASR基本模型到IARPA Babel Swahili模型的转移学习中,相对于两阶段转移基线,最佳方案将CER,WER的评估集转移模型提高了9.9%,9.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号