...
首页> 外文期刊>Communications, China >Single-channel speech enhancement based on improved frame-iterative spectral subtraction in the modulation domain
【24h】

Single-channel speech enhancement based on improved frame-iterative spectral subtraction in the modulation domain

机译:基于改进的调制域中帧迭代谱减法的单通道语音增强

获取原文
获取原文并翻译 | 示例
           

摘要

Aiming at the problem of music noise introduced by classical spectral subtraction, a short-time modulation domain (STM) spectral subtraction method has been successfully applied for single-channel speech enhancement. However, due to the inaccurate voice activity detection (VAD), the residual music noise and enhanced performance still need to be further improved, especially in the low signal to noise ratio (SNR) scenarios. To address this issue, an improved frame iterative spectral subtraction in the STM domain (IMModSSub) is proposed. More specifically, with the inter-frame correlation, the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain. Then, the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR. With these classification results, a corresponding mask function is developed for noisy speech after noise subtraction. Finally, exploiting the increased sparsity of speech signal in the modulation domain, the orthogonal matching pursuit (OMP) technique is employed to the speech frames for improving the speech quality and intelligibility. The effectiveness of the proposed method is evaluated with three types of noise, including white noise, pink noise, and hfchannel noise. The obtained results show that the proposed method outperforms some established baselines at lower SNRs (5 to +5 dB).
机译:针对经典光谱减法引入的音乐噪声问题,已经成功地应用了短时间调制域(STM)光谱减法方法以进行单通道语音增强。然而,由于语音活动检测(VAD)不准确,还需要进一步改善剩余的音乐噪声和增强的性能,尤其是在低信噪比(SNR)场景中。为了解决这个问题,提出了STM域(Immodssub)中的改进的帧迭代光谱减法。更具体地,利用帧间相关性,直接噪声减法以处理STM域中的每个帧的噪声信号。然后,基于分段SNR的预定阈值,将噪声信号分为语音或沉默帧。利用这些分类结果,在噪声减法之后开发了相应的掩模功能以进行嘈杂的语音。最后,利用调制域中的语音信号的增加的稀疏性,正交匹配追踪(OMP)技术用于语音帧以提高语音质量和可懂度。所提出的方法的有效性被三种类型的噪声评估,包括白噪声,粉红色噪声和HFChannel噪声。所得结果表明,该方法优于较低的SNR(5至+ 5 dB)的一些建立的基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号