首页> 外国专利> Input-feeding architecture for attention based end-to-end speech recognition

Input-feeding architecture for attention based end-to-end speech recognition

机译：输入馈送架构，用于基于注意力的端到端语音识别

页面导航

摘要
著录项
相似文献

摘要

Methods and apparatuses are provided for performing end-to-end speech recognition training performed by at least one processor. The method includes receiving, by the at least one processor, one or more input speech frames, generating, by the at least one processor, a sequence of encoder hidden states by transforming the input speech frames, computing, by the at least one processor, attention weights based on each of the sequence of encoder hidden states and a current decoder hidden state, performing, by the at least one processor, a decoding operation based on a previous embedded label prediction information and a previous attentional hidden state information generated based on the attention weights; and generating a current embedded label prediction information based on a result of the decoding operation and the attention weights.

机译：提供了用于执行由至少一个处理器执行的端到端语音识别训练的方法和装置。该方法包括：由至少一个处理器接收一个或多个输入语音帧，由至少一个处理器通过变换输入语音帧来生成一系列编码器隐藏状态，由至少一个处理器计算，基于所述编码器隐藏状态和当前解码器隐藏状态的序列中的每一个的关注权重，由所述至少一个处理器执行基于先前嵌入的标签预测信息和基于所述隐藏标签生成的先前关注的隐藏状态信息的解码操作注意权重;根据解码操作的结果和关注权重，生成当前的嵌入标签预测信息。

著录项

公开/公告号US10672382B2

专利类型
公开/公告日2020-06-02

原文格式PDF
申请/专利权人 TENCENT AMERICA LLC;
展开▼

申请/专利号US201816160352
发明设计人 CHAO WENG;JIA CUI;GUANGSEN WANG;JUN WANG;CHENGZHU YU;DAN SU;DONG YU;
展开▼

申请日2018-10-15
分类号G10L15/06;G10L15/14;G10L15/183;G10L15/22;
国家 US
入库时间 2022-08-21 11:28:08

相似文献

专利
外文文献
中文文献