首页> 外国专利> METHOD AND APPARATUS FOR SPEECH END-POINT DETECTION USING ACOUSTIC AND LANGUAGE MODELING KNOWLEDGE FOR ROBUST SPEECH RECOGNITION

METHOD AND APPARATUS FOR SPEECH END-POINT DETECTION USING ACOUSTIC AND LANGUAGE MODELING KNOWLEDGE FOR ROBUST SPEECH RECOGNITION

机译:使用声学和语言建模知识进行语音终点检测的方法和装置,用于鲁棒语音识别

摘要

A method and apparatus for detecting a voice endpoint using acoustic and language modeling information for robust voice recognition are presented. The method for detecting a voice endpoint according to an embodiment includes a Recurrent Neural Network (RNN)-based acoustic embedding extractor, a phoneme embedding extractor, and a decoder embedding extractor that inputs an acoustic feature vector sequence extracted from a microphone input signal. step; constructing a feature vector by combining sound embedding, phoneme embedding, and decoder embedding in the sound embedding extractor, the phoneme embedding extractor, and the decoder embedding extractor; and inputting the combined feature vector into a deep neural network (DNN)-based classifier to detect a voice endpoint.
机译:呈现用于使用用于鲁棒语音识别的声学和语言建模信息来检测语音端点的方法和装置。 根据实施例的用于检测语音端点的方法包括经常性神经网络(RNN)的声学嵌入提取器,音素嵌入提取器和解码器嵌入提取器,其输入从麦克风输入信号提取的声学特征向量序列。 步; 通过将声音嵌入,音素嵌入和解码器嵌入声音嵌入提取器,音素嵌入提取器和解码器嵌入提取器中的声音嵌入,音素嵌入和解码器构建特征矢量; 并将组合的特征向量输入到深度神经网络(DNN)的基础分类器中以检测语音端点。

著录项

  • 公开/公告号KR102305672B1

    专利类型

  • 公开/公告日2021-09-28

    原文格式PDF

  • 申请/专利权人 한양대학교 산학협력단;

    申请/专利号KR20190086305

  • 发明设计人 장준혁;황인영;

    申请日2019-07-17

  • 分类号G10L25/87;G06N3/02;G10L15/06;G10L15/183;G10L19/038;G10L25/30;

  • 国家 KR

  • 入库时间 2022-08-24 21:19:06

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号