首页> 外国专利> Input-feeding architecture for attention based end-to-end speech recognition

Input-feeding architecture for attention based end-to-end speech recognition

机译:输入馈送架构,用于基于注意力的端到端语音识别

摘要

Methods and apparatuses are provided for performing end-to-end speech recognition training performed by at least one processor. The method includes receiving, by the at least one processor, one or more input speech frames, generating, by the at least one processor, a sequence of encoder hidden states by transforming the input speech frames, computing, by the at least one processor, attention weights based on each of the sequence of encoder hidden states and a current decoder hidden state, performing, by the at least one processor, a decoding operation based on a previous embedded label prediction information and a previous attentional hidden state information generated based on the attention weights; and generating a current embedded label prediction information based on a result of the decoding operation and the attention weights.
机译:提供了用于执行由至少一个处理器执行的端到端语音识别训练的方法和装置。该方法包括:由至少一个处理器接收一个或多个输入语音帧,由至少一个处理器通过变换输入语音帧来生成一系列编码器隐藏状态,由至少一个处理器计算,基于所述编码器隐藏状态和当前解码器隐藏状态的序列中的每一个的关注权重,由所述至少一个处理器执行基于先前嵌入的标签预测信息和基于所述隐藏标签生成的先前关注的隐藏状态信息的解码操作注意权重;根据解码操作的结果和关注权重,生成当前的嵌入标签预测信息。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号