首页> 外文会议>International Conference on Signal Processing, Communication, Power and Embedded System >Improving single frequency filtering based Voice Activity Detection (VAD) using spectral subtraction based noise cancellation
【24h】

Improving single frequency filtering based Voice Activity Detection (VAD) using spectral subtraction based noise cancellation

机译:使用基于频谱减法的噪声消除来改善基于单频滤波的语音活动检测(VAD)

获取原文
获取原文并翻译 | 示例

摘要

Voice Activity Detection (VAD) is a basic front end step in all of the speech processing engines. There have been proposed many Time domain and Frequency domain algorithms with different computational complexity. Common artifacts in VAD are Front End Clipping (FEC), Mid Speech Clipping (MSC), over clipping and Noise detected as Speech (NDS). Performance of VAD is dependent on SNR between speech and background noise. Background noise could be either a wide band noise spanning entire speech frequency band or it might be a narrowband noise which does not interfere with the speech frequency band. Simple VAD methods based on short time energy and zero crossing detection fails to discriminate between speech frame and noise frame in low SNR environment. The frequency domain methods have higher accuracy in low SNR environments. In Single Frequency Filtering (SFF) approach instead of computing the complete FFT of the given audio frame, the power envelope of the spectrum at a discrete frequency interval 20Hz in the speech band of 20Hz to 4KHz are computed. The accuracy of the SFF VAD can be improved by applying an adaptive Noise Canceller prior to SFF VAD. The VAD decision of SFF approach at the transition from speech to noise and vice versa is improved to a remarkable extent by using an adaptive noise cancellation. Evaluation of SFF approach was done with Adaptive Spectral Subtraction based Noise Cancellation prior to SFF VAD.
机译:语音活动检测(VAD)是所有语音处理引擎中的基本前端步骤。已经提出了许多具有不同计算复杂度的时域和频域算法。 VAD中常见的伪像是前端削波(FEC),中间语音削波(MSC),过度削波和噪声检测为语音(NDS)。 VAD的性能取决于语音和背景噪声之间的SNR。背景噪声可以是跨越整个语音频带的宽带噪声,也可以是不干扰语音频带的窄带噪声。基于短时能量和零交叉检测的简单VAD方法无法在低SNR环境中区分语音帧和噪声帧。频域方法在低SNR环境中具有更高的精度。在单频滤波(SFF)方法中,不是计算给定音频帧的完整FFT,而是计算20Hz至4KHz语音频带中离散频率间隔20Hz的频谱功率包络。通过在SFF VAD之前应用自适应降噪器,可以提高SFF VAD的精度。通过使用自适应噪声消除,从语音到噪声反之亦然的SFF方法的VAD决策得到了显着改善。 SFF方法的评估是在SFF VAD之前使用基于自适应谱减的噪声消除方法完成的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号