首页> 外国专利> Processing acoustic sequences using long short-term memory (LSTM) neural networks that include recurrent projection layers

Processing acoustic sequences using long short-term memory (LSTM) neural networks that include recurrent projection layers

机译：使用包含循环投影层的长短期记忆（LSTM）神经网络处理声音序列

页面导航

摘要
著录项
相似文献

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating phoneme representations of acoustic sequences using projection sequences. One of the methods includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps, processing the acoustic feature representation through each of one or more long short-term memory (LSTM) layers; and for each of the plurality of time steps, processing the recurrent projected output generated by the highest LSTM layer for the time step using an output layer to generate a set of scores for the time step.

机译：方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于使用投影序列生成声音序列的音素表示。该方法之一包括：在多个时间步长的每一个上接收声音序列，该声音序列表示话语，并且该声音序列包括相应的声音特征表示。对于多个时间步长中的每一个，通过一个或多个长短期记忆（LSTM）层中的每一层处理声学特征表示;对于多个时间步长中的每一个，使用输出层为该时间步长处理由最高LSTM层生成的递归投影输出，以生成该时间步长的一组分数。

著录项

公开/公告号US10026397B2

专利类型
公开/公告日2018-07-17

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号US201715454407
发明设计人 HASIM SAK;ANDREW W. SENIOR;
展开▼

申请日2017-03-09
分类号G10L15/16;G10L15/02;G10L15/14;
国家 US
入库时间 2022-08-21 13:05:44

相似文献

专利
外文文献
中文文献