首页> 外国专利> Method and system for training language models to reduce recognition errors

Method and system for training language models to reduce recognition errors

机译:训练语言模型以减少识别错误的方法和系统

摘要

A method and for training a language model to reduce recognition errors, wherein the language model is a recurrent neural network language model (RNNLM) by first acquiring training samples. An automatic speech recognition system (ASR) is applied to the training samples to produce recognized words and probabilites of the recognized words, and an N-best list is selected from the recognized words based on the probabilities. determining word errors using reference data for hypotheses in the N-best list. The hypotheses are rescored using the RNNLM. Then, we determine gradients for the hypotheses using the word errors and gradients for words in the hypotheses. Lastly, parameters of the RNNLM are updated using a sum of the gradients.
机译:一种用于训练语言模型以减少识别错误的方法和方法,其中,通过首先获取训练样本,该语言模型是递归神经网络语言模型(RNNLM)。将自动语音识别系统(ASR)应用于训练样本以生成识别的单词和识别单词的概率,并基于概率从识别的单词中选择N个最佳列表。使用N最佳列表中的假设的参考数据确定单词错误。使用RNNLM对这些假设进行重新评估。然后,我们使用单词错误和假设中单词的梯度确定假设的梯度。最后,使用梯度之和更新RNNLM的参数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号