首页> 外国专利> SOUND MODELING AND VOICE RECOGNIZING METHOD FOR LONG-DISTANCE VOICE RECOGNITION BASED ON MULTI-INPUT DEEP NEURAL NETWORK

SOUND MODELING AND VOICE RECOGNIZING METHOD FOR LONG-DISTANCE VOICE RECOGNITION BASED ON MULTI-INPUT DEEP NEURAL NETWORK

机译:基于多输入深度神经网络的长途语音识别的声音建模与语音识别方法

摘要

To solve the deterioration of recognition performance caused by reverberations in regard to long-distance voice recognition, the present invention provides a deep neural network having multi-segment input for processing a reverberation component changing depending on various environments including a spatial space, a microphone distance, and the like, and a distance-independent voice modeling method using the same. According to the present invention, for long-distance voice recognition, segments having frames of various lengths are formed to form a plurality of multi-input segments, and convolution matrixes having different sizes are applied to each of the segments to obtain output having the same capacity for each of the segments, and then, if convolution matrixes having the same size are outputted, convolution matrixes of all of the segments are compared to obtain a maximum value for building a sound model.;COPYRIGHT KIPO 2018
机译:为了解决关于长距离语音识别的混响引起的识别性能的劣化,本发明提供了一种具有多段输入的深度神经网络,用于处理混响分量的变化,该混响分量根据包括空间空间,麦克风距离在内的各种环境而变化。等等,以及使用该方法的距离无关的语音建模方法。根据本发明,对于长途语音识别,形成具有各种长度的帧的段以形成多个多输入段,并且将具有不同大小的卷积矩阵应用于每个段以获得具有相同大小的输出。每个段的容量,然后,如果输出具有相同大小的卷积矩阵,则将所有段的卷积矩阵进行比较以获得构建声音模型的最大值。; COPYRIGHT KIPO 2018

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号