首页> 外国专利> SOUND MODELING AND VOICE RECOGNIZING METHOD FOR LONG-DISTANCE VOICE RECOGNITION BASED ON MULTI-INPUT DEEP NEURAL NETWORK

SOUND MODELING AND VOICE RECOGNIZING METHOD FOR LONG-DISTANCE VOICE RECOGNITION BASED ON MULTI-INPUT DEEP NEURAL NETWORK

机译：基于多输入深度神经网络的长途语音识别的声音建模与语音识别方法

页面导航

摘要
著录项
相似文献

摘要

To solve the deterioration of recognition performance caused by reverberations in regard to long-distance voice recognition, the present invention provides a deep neural network having multi-segment input for processing a reverberation component changing depending on various environments including a spatial space, a microphone distance, and the like, and a distance-independent voice modeling method using the same. According to the present invention, for long-distance voice recognition, segments having frames of various lengths are formed to form a plurality of multi-input segments, and convolution matrixes having different sizes are applied to each of the segments to obtain output having the same capacity for each of the segments, and then, if convolution matrixes having the same size are outputted, convolution matrixes of all of the segments are compared to obtain a maximum value for building a sound model.;COPYRIGHT KIPO 2018

机译：为了解决关于长距离语音识别的混响引起的识别性能的劣化，本发明提供了一种具有多段输入的深度神经网络，用于处理混响分量的变化，该混响分量根据包括空间空间，麦克风距离在内的各种环境而变化。等等，以及使用该方法的距离无关的语音建模方法。根据本发明，对于长途语音识别，形成具有各种长度的帧的段以形成多个多输入段，并且将具有不同大小的卷积矩阵应用于每个段以获得具有相同大小的输出。每个段的容量，然后，如果输出具有相同大小的卷积矩阵，则将所有段的卷积矩阵进行比较以获得构建声音模型的最大值。; COPYRIGHT KIPO 2018

著录项

公开/公告号KR20180084464A

专利类型
公开/公告日2018-07-25

原文格式PDF
申请/专利权人 ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE;
展开▼

申请/专利号KR20170008101
发明设计人 JUNG HO YOUNGKR;PARK KI YOUNGKR;
展开▼

申请日2017-01-17
分类号G10L15/20;G10L15/18;
国家 KR
入库时间 2022-08-21 12:39:27

相似文献

专利
外文文献
中文文献