首页> 外国专利> SPEECH ENHANCEMENT METHOD BASED ON FULLY CONVOLUTIONAL NEURAL NETWORK, DEVICE, AND STORAGE MEDIUM

SPEECH ENHANCEMENT METHOD BASED ON FULLY CONVOLUTIONAL NEURAL NETWORK, DEVICE, AND STORAGE MEDIUM

机译:基于全卷积神经网络,设备和存储介质的语音增强方法

摘要

The present application relates to the field of artificial intelligence. Disclosed is a speech enhancement method based on a fully convolutional neural network. The method comprises: constructing a fully convolutional neural network model, the fully convolutional neural network comprising an input layer, a hidden layer, and an output layer; the hidden layer comprising a plurality of convolutional layers; each of the plurality of convolutional layers comprising a plurality of filters; training the fully convolutional neural network model; inputting an original speech signal into the trained fully convolutional neural network model; and outputting an enhanced speech signal. In the fully convolutional neural network model of the present application, a full connection layer is deleted, and only convolutional layers are comprised, so that parameters of the neural network are significantly reduced, the fully convolutional neural network model can be suitable for a mobile device having the memory limited, each output sample only relies on adjacent inputs, and original information and spatial arrangement information of the speech signal can be reserved well at less weight values. Also disclosed are an electronic device and a computer readable storage medium.
机译:本申请涉及人工智能领域。公开了一种基于全卷积神经网络的语音增强方法。该方法包括:建立全卷积神经网络模型,该全卷积神经网络包括输入层,隐藏层和输出层;隐藏层包括多个卷积层;多个卷积层中的每一个包括多个滤波器;训练全卷积神经网络模型;将原始语音信号输入经过训练的全卷积神经网络模型;输出增强的语音信号。在本申请的全卷积神经网络模型中,删除了全连接层,仅包含卷积层,从而大大降低了神经网络的参数,该全卷积神经网络模型可以适用于移动设备由于具有有限的存储器,每个输出样本仅依赖于相邻的输入,并且可以以较小的权重值很好地保留语音信号的原始信息和空间布置信息。还公开了一种电子设备和计算机可读存储介质。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号