首页> 外国专利> LARGE MARGIN TRACKING FOR ATTENTION-BASED END-TO-END SPEECH RECOGNITION

LARGE MARGIN TRACKING FOR ATTENTION-BASED END-TO-END SPEECH RECOGNITION

机译:大幅度跟踪,用于基于注意力的端到端语音识别

摘要

A method of attention-based end-to-end (E2E) automatic speech recognition (ASR) training, includes performing cross-entropy training of a model, based on one or more input features of a speech signal, performing beam searching of the model of which the cross-entropy training is performed, to generate an n-best hypotheses list of output hypotheses, and determining a one-best hypothesis among the generated n-best hypotheses list. The method further includes determining a character-based gradient and a word-based gradient, based on the model of which the cross-entropy training is performed and a loss function in which a distance between a reference sequence and the determined one-best hypothesis is maximized, and performing backpropagation of the determined character-based gradient and the determined word-based gradient to the model, to update the model.
机译:一种基于注意力的端到端(E2E)自动语音识别(ASR)训练的方法,包括基于语音信号的一个或多个输入特征执行模型的交叉熵训练,执行模型的波束搜索其中执行交叉熵训练,以生成输出假设的n个最佳假设列表,并在生成的n个最佳假设列表中确定一个最佳假设。该方法进一步包括基于执行交叉熵训练的模型以及其中参考序列与所确定的一个最佳假设之间的距离为的损失函数来确定基于字符的梯度和基于单词的梯度。最大化,并对确定的基于字符的坡度和确定的基于单词的坡度进行反向传播到模型,以更新模型。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号