首页> 外国专利> SINGLE-SOUND CHANNEL ROBUSTNESS SPEECH KEYWORD REAL-TIME DETECTION METHOD

SINGLE-SOUND CHANNEL ROBUSTNESS SPEECH KEYWORD REAL-TIME DETECTION METHOD

机译:单声道频道鲁棒性语音关键字实时检测方法

摘要

A single-sound channel robustness speech keyword real-time detection method, comprising the following steps: receiving noisy speech of an electronic format; converting a time domain speech signal into a frequency domain signal by means of short-time Fourier transform in a frame-by-frame mode; using a Mel filter to process the frequency domain signal so as to obtain a Mel feature as an acoustic feature; making the Mel feature pass a neural network in a frame-by-frame mode, and then using a normalized exponential function to process the Mel feature to obtain the confidence degree information of each keyword; when the confidence degree information of a certain keyword is greater than a predefined threshold, splicing the current frame and previous several frames so as to be used as an output of the neural network; and sequentially passing through an attention mechanism and a feed-forward type deep neural network, and performing processing by means of the normalized exponential function so as to obtain the confidence degree information of each sentence-level keyword, when a confidence degree value is greater than the predefined threshold, considering that the keyword is detected, and otherwise, considering the keyword is not detected. The method still can keep a high wakeup rate in a noisy environment, has wide applicability, and can greatly reduce the false alarm rate of the neural network and improve the detection performance of the keyword.
机译:单声道频道鲁棒性语音关键字实时检测方法,包括以下步骤:接收电子格式的嘈杂语音;通过帧帧模式下的短时傅里叶变换将时域语音信号转换为频域信号;使用MEL滤波器处理频域信号,以便获得MEL功能作为声学特征;使MEL功能通过帧帧模式通过神经网络,然后使用归一化指数函数来处理MEL功能以获得每个关键字的置信度信息;当某个关键字的置信度信息大于预定义阈值时,拼接当前帧和先前的几帧,以便用作神经网络的输出;顺序地通过注意机制和前馈类型深神经网络,并通过归一化指数函数进行处理,以便在置信度值大于时获得每个句子级关键字的置信度信息。考虑检测到关键字的预定义阈值,否则,考虑未检测到关键字。该方法仍然可以在嘈杂的环境中保持高唤醒率,具有广泛的适用性,并且可以大大降低神经网络的误报率,提高关键字的检测性能。

著录项

  • 公开/公告号WO2021062705A1

    专利类型

  • 公开/公告日2021-04-08

    原文格式PDF

  • 申请/专利权人 ELEVOC TECHNOLOGY CO. LTD.;

    申请/专利号WO2019CN109603

  • 发明设计人 HU PENG;YAN YONGJIE;

    申请日2019-09-30

  • 分类号G10L15/02;G10L15/14;

  • 国家 CN

  • 入库时间 2024-06-14 21:24:48

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号