首页> 外国专利> MINIMUM WORD ERROR RATE TRAINING FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS

MINIMUM WORD ERROR RATE TRAINING FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS

机译：基于注意力的序列到序列模型的最小单词错误率训练

页面导航

摘要
著录项
相似文献

摘要

Methods, systems, and apparatus, including computer programs encoded on computer-readable storage media, for speech recognition using attention-based sequence-to-sequence models. In some implementations, audio data indicating acoustic characteristics of an utterance is received. A sequence of feature vectors indicative of the acoustic characteristics of the utterance is generated. The sequence of feature vectors is processed using a speech recognition model that has been trained using a loss function that uses N-best lists of decoded hypotheses, the speech recognition model including an encoder, an attention module, and a decoder. The encoder and decoder each include one or more recurrent neural network layers. A sequence of output vectors representing distributions over a predetermined set of linguistic units is obtained. A transcription for the utterance is obtained based on the sequence of output vectors. Data indicating the transcription of the utterance is provided.

机译：方法，系统和装置，包括编码在计算机可读存储介质上的计算机程序，用于使用基于注意力的序列到序列模型进行语音识别。在一些实施方式中，接收指示话语的声学特性的音频数据。产生指示话音的声学特性的一系列特征向量。使用已经使用损失函数训练的语音识别模型来处理特征向量的序列，该损失函数使用解码假设的N个最佳列表，该语音识别模型包括编码器，关注模块和解码器。编码器和解码器各自包括一个或多个递归神经网络层。获得表示在预定语言单元集合上的分布的输出矢量序列。基于输出向量的序列获得话语的转录。提供指示话语转录的数据。

著录项

公开/公告号US2020043483A1

专利类型
公开/公告日2020-02-06

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号US201916529252
发明设计人 ROHIT PRAKASH PRABHAVALKAR;TARA N. SAINATH;YONGHUI WU;PATRICK AN PHU NGUYEN;ZHIFENG CHEN;CHUNG-CHENG CHIU;ANJULI PATRICIA KANNAN;
展开▼

申请日2019-08-01
分类号G10L15/197;G10L15/16;G10L15/22;G10L15/06;G10L15/02;
国家 US
入库时间 2022-08-21 11:18:56

相似文献

专利
外文文献
中文文献