首页> 外国专利> METHOD AND APPARATUS FOR SPEECH ENDPOINT DETECTION BASED ON JOINTLY TRAINED DEEP NEURAL NETWORKS FOR COMBINING ACOUSTIC EMBEDDING WITH CONTEXT OF AUTOMATIC SPEECH RECOGNITION

METHOD AND APPARATUS FOR SPEECH ENDPOINT DETECTION BASED ON JOINTLY TRAINED DEEP NEURAL NETWORKS FOR COMBINING ACOUSTIC EMBEDDING WITH CONTEXT OF AUTOMATIC SPEECH RECOGNITION

机译：基于联合训练的深度神经网络结合语音嵌入和自动语音识别的语音端点检测方法和装置

页面导航

摘要
著录项
相似文献

摘要

A method and apparatus for detecting a voice endpoint based on a deep neural network that combines learning acoustic feature vector embedding and voice recognition context are presented. According to an embodiment, a method for detecting a voice endpoint based on a deep neural network may include inputting an acoustic feature vector sequence extracted from a microphone input signal to a first deep neural network (DNN) model and a second deep neural network model; And detecting a voice endpoint through a density layer by combining the hidden state of the last hidden layer of the first deep neural network model and the second deep neural network model.

机译：提出了一种基于深度神经网络的语音端点检测方法和装置，该方法结合了学习声学特征向量嵌入和语音识别上下文。根据一个实施例，一种基于深度神经网络的语音端点检测方法可以包括：将从麦克风输入信号中提取的声学特征矢量序列输入到第一深度神经网络模型和第二深度神经网络模型。通过结合第一深度神经网络模型和第二深度神经网络模型的最后隐藏层的隐藏状态，通过密度层检测语音端点。

著录项

公开/公告号KR20200101495A

专利类型
公开/公告日2020-08-28

原文格式PDF
申请/专利权人 한양대학교 산학협력단;
展开▼

申请/专利号KR20190010972
发明设计人 장준혁;황인영;
展开▼

申请日2019-01-29
分类号G10L25/87;G06N3/02;G10L15/04;G10L15/14;G10L15/187;G10L25/30;
国家 KR
入库时间 2022-08-21 11:06:10

相似文献

专利
外文文献
中文文献