首页> 外国专利> Speech recognition using convolutional neural networks

Speech recognition using convolutional neural networks

机译:使用卷积神经网络的语音识别

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.
机译:方法,系统和设备,包括在计算机存储介质上编码的计算机程序,用于通过生成从音频数据输入序列的神经网络输出的神经网络来执行语音识别,其中神经网络输出表征音频数据输入序列中的单词。其中一个方法包括用于每个音频数据输入,提供包括音频数据输入的当前音频数据输入序列和在音频数据输入序列中的音频数据输入的音频数据输入到包括多个的卷积子网。扩张的卷积神经网络层,其中卷积子网被配置为,对于多个音频数据输入中的每一个:接收用于音频数据输入的当前音频数据输入序列,处理当前音频数据输入序列以生成替代音频数据输入的表示。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号