首页> 外国专利> VERY DEEP CONVOLUTIONAL NEURAL NETWORKS FOR END-TO-END SPEECH RECOGNITION

VERY DEEP CONVOLUTIONAL NEURAL NETWORKS FOR END-TO-END SPEECH RECOGNITION

机译：用于端到端语音识别的非常深层的卷积神经网络

页面导航

摘要
著录项
相似文献

摘要

A speech recognition neural network system includes an encoder neural network and a decoder neural network. The encoder neural network generates an encoded sequence from an input acoustic sequence that represents an utterance. The input acoustic sequence includes a respective acoustic feature representation at each of a plurality of input time steps, the encoded sequence includes a respective encoded representation at each of a plurality of time reduced time steps, and the number of time reduced time steps is less than the number of input time steps. The encoder neural network includes a time reduction subnetwork, a convolutional LSTM subnetwork, and a network in network subnetwork. The decoder neural network receives the encoded sequence and processes the encoded sequence to generate, for each position in an output sequence order, a set of sub string scores that includes a respective sub string score for each substring in a set of substrings.

机译：语音识别神经网络系统包括编码器神经网络和解码器神经网络。编码器神经网络从代表话语的输入声学序列生成编码序列。输入声学序列在多个输入时间步长的每个步长包括相应的声学特征表示，编码序列在多个时间缩减时间步长的每个步阶包括相应的编码表示，并且时间缩减时间步长的数量小于输入时间步数。编码器神经网络包括时间减少子网，卷积LSTM子网和网络子网中的网络。解码器神经网络接收编码的序列并处理编码的序列，以按照输出序列的顺序为每个位置生成一组子字符串分数，该子字符串分数包括一组子字符串中每个子字符串的相应子字符串分数。

著录项

公开/公告号US2020090044A1

专利类型
公开/公告日2020-03-19

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号US201916692538
发明设计人 NAVDEEP JAITLY;YU ZHANG;WILLIAM CHAN;
展开▼

申请日2019-11-22
分类号G06N3/08;G06N3/04;G10L15/16;G10L15/02;G10L15/22;
国家 US
入库时间 2022-08-21 11:23:17

相似文献

专利
外文文献
中文文献