首页> 外国专利> SINGLE-SOUND CHANNEL ROBUSTNESS SPEECH KEYWORD REAL-TIME DETECTION METHOD

SINGLE-SOUND CHANNEL ROBUSTNESS SPEECH KEYWORD REAL-TIME DETECTION METHOD

机译：单声道频道鲁棒性语音关键字实时检测方法

页面导航

摘要
著录项
相似文献

摘要

A single-sound channel robustness speech keyword real-time detection method, comprising the following steps: receiving noisy speech of an electronic format; converting a time domain speech signal into a frequency domain signal by means of short-time Fourier transform in a frame-by-frame mode; using a Mel filter to process the frequency domain signal so as to obtain a Mel feature as an acoustic feature; making the Mel feature pass a neural network in a frame-by-frame mode, and then using a normalized exponential function to process the Mel feature to obtain the confidence degree information of each keyword; when the confidence degree information of a certain keyword is greater than a predefined threshold, splicing the current frame and previous several frames so as to be used as an output of the neural network; and sequentially passing through an attention mechanism and a feed-forward type deep neural network, and performing processing by means of the normalized exponential function so as to obtain the confidence degree information of each sentence-level keyword, when a confidence degree value is greater than the predefined threshold, considering that the keyword is detected, and otherwise, considering the keyword is not detected. The method still can keep a high wakeup rate in a noisy environment, has wide applicability, and can greatly reduce the false alarm rate of the neural network and improve the detection performance of the keyword.

机译：单声道频道鲁棒性语音关键字实时检测方法，包括以下步骤：接收电子格式的嘈杂语音;通过帧帧模式下的短时傅里叶变换将时域语音信号转换为频域信号;使用MEL滤波器处理频域信号，以便获得MEL功能作为声学特征;使MEL功能通过帧帧模式通过神经网络，然后使用归一化指数函数来处理MEL功能以获得每个关键字的置信度信息;当某个关键字的置信度信息大于预定义阈值时，拼接当前帧和先前的几帧，以便用作神经网络的输出;顺序地通过注意机制和前馈类型深神经网络，并通过归一化指数函数进行处理，以便在置信度值大于时获得每个句子级关键字的置信度信息。考虑检测到关键字的预定义阈值，否则，考虑未检测到关键字。该方法仍然可以在嘈杂的环境中保持高唤醒率，具有广泛的适用性，并且可以大大降低神经网络的误报率，提高关键字的检测性能。

著录项

公开/公告号WO2021062705A1

专利类型
公开/公告日2021-04-08

原文格式PDF
申请/专利权人 ELEVOC TECHNOLOGY CO. LTD.;
展开▼

申请/专利号WO2019CN109603
发明设计人 HU PENG;YAN YONGJIE;
展开▼

申请日2019-09-30
分类号G10L15/02;G10L15/14;
国家 CN
入库时间 2024-06-14 21:24:48

相似文献

专利
外文文献
中文文献