首页> 外国专利> Speech recognition using convolutional neural networks

Speech recognition using convolutional neural networks

机译：使用卷积神经网络的语音识别

页面导航

摘要
著录项
相似文献

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.

机译：方法，系统和设备，包括在计算机存储介质上编码的计算机程序，用于通过生成从音频数据输入序列的神经网络输出的神经网络来执行语音识别，其中神经网络输出表征音频数据输入序列中的单词。其中一个方法包括用于每个音频数据输入，提供包括音频数据输入的当前音频数据输入序列和在音频数据输入序列中的音频数据输入的音频数据输入到包括多个的卷积子网。扩张的卷积神经网络层，其中卷积子网被配置为，对于多个音频数据输入中的每一个：接收用于音频数据输入的当前音频数据输入序列，处理当前音频数据输入序列以生成替代音频数据输入的表示。

著录项

公开/公告号US11069345B2

专利类型
公开/公告日2021-07-20

原文格式PDF
申请/专利权人 DEEPMIND TECHNOLOGIES LIMITED;
展开▼

申请/专利号US201916719424
发明设计人 AARON GERARD ANTONIUS VAN DEN OORD;SANDER ETIENNE LEA DIELEMAN;NAL EMMERICH KALCHBRENNER;KAREN SIMONYAN;ORIOL VINYALS;LASSE ESPEHOLT;
展开▼

申请日2019-12-18
分类号G10L15/16;G06N3/04;G06N3/08;G10L15/02;G10L15/22;
国家 US
入库时间 2022-08-24 20:01:00

相似文献

专利
外文文献
中文文献