首页> 外国专利> MINIMUM WORD ERROR RATE TRAINING FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS

MINIMUM WORD ERROR RATE TRAINING FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS

机译:基于注意力的序列到序列模型的最小单词错误率训练

摘要

Methods, systems, and apparatus, including computer programs encoded on computer-readable storage media, for speech recognition using attention-based sequence-to-sequence models. In some implementations, audio data indicating acoustic characteristics of an utterance is received. A sequence of feature vectors indicative of the acoustic characteristics of the utterance is generated. The sequence of feature vectors is processed using a speech recognition model that has been trained using a loss function that uses N-best lists of decoded hypotheses, the speech recognition model including an encoder, an attention module, and a decoder. The encoder and decoder each include one or more recurrent neural network layers. A sequence of output vectors representing distributions over a predetermined set of linguistic units is obtained. A transcription for the utterance is obtained based on the sequence of output vectors. Data indicating the transcription of the utterance is provided.
机译:方法,系统和装置,包括编码在计算机可读存储介质上的计算机程序,用于使用基于注意力的序列到序列模型进行语音识别。在一些实施方式中,接收指示话语的声学特性的音频数据。产生指示话音的声学特性的一系列特征向量。使用已经使用损失函数训练的语音识别模型来处理特征向量的序列,该损失函数使用解码假设的N个最佳列表,该语音识别模型包括编码器,关注模块和解码器。编码器和解码器各自包括一个或多个递归神经网络层。获得表示在预定语言单元集合上的分布的输出矢量序列。基于输出向量的序列获得话语的转录。提供指示话语转录的数据。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号