...
首页> 外文期刊>Signal processing >Adaptive β-order Generalized Spectral Subtraction For Speech Enhancement
【24h】

Adaptive β-order Generalized Spectral Subtraction For Speech Enhancement

机译:自适应β阶广义谱减法用于语音增强

获取原文
获取原文并翻译 | 示例
           

摘要

The performance degradation of speech communication systems in noisy environments inspired increasing research on speech enhancement and noise reduction. As a well-known single-channel noise reduction technique, spectral subtraction (SS) has widely been used for speech enhancement. However, the spectral order β set in SS is always fixed to some constants, resulting in performance limitation to a certain degree. In this paper, we first analyze the performance of the β-order generalized spectral subtraction (GSS) in terms of the gain function to highlight its dependence on the value of spectral order β. A data-driven optimization scheme is then introduced to quantitatively determine the change of β with the change of the input signal-to-noise ratio (SNR). Based on the analysis results and considering the non-uniform effect of real-world noise on speech signal, we propose an adaptive β-order GSS in which the spectral order β is adaptively updated according to the local SNR in each critical band frame by frame as in a sigmoid function. The performance of the proposed adaptive β-order GSS is finally evaluated objectively by segmental SNR (SEGSNR) and log-spectral distance (LSD), and subjectively by spectrograms and mean opinion score (MOS), using comprehensive experiments in various noise conditions. Experimental results show that the proposed algorithm yields an average SEGSNR increase of 2.99 dB and an average LSD reduction of 2.71 dB, which are much larger improvement than that obtained with the competing SS algorithms. The superiority of the proposed algorithm is also demonstrated by the highest MOS ratings obtained from the listening tests.
机译:在嘈杂的环境中,语音通​​信系统的性能下降促使人们对语音增强和降噪进行了越来越多的研究。作为一种众所周知的单通道降噪技术,频谱减法(SS)已被广泛用于语音增强。然而,在SS中设置的频谱阶数β总是固定为一些常数,从而在一定程度上导致性能限制。在本文中,我们首先根据增益函数分析了β级广义谱减法(GSS)的性能,以突出其对谱级β值的依赖性。然后引入数据驱动的优化方案,以定量确定随输入信噪比(SNR)的变化而变化的β。基于分析结果并考虑现实噪声对语音信号的非均匀影响,我们提出了一种自适应β阶GSS,其中根据每个关键频带的局部信噪比逐帧自适应更新频谱阶β就像在S型函数中一样。最后,通过在各种噪声条件下进行综合实验,通过分段SNR(SEGSNR)和对数谱距离(LSD)客观地评估提出的自适应β阶GSS的性能,并通过频谱图和平均观点得分(MOS)来主观地评估所提出的自适应β阶GSS的性能。实验结果表明,该算法产生的SEGSNR平均提高了2.99 dB,平均LSD降低了2.71 dB,这比竞争SS算法获得的改进要大得多。从收听测试中获得的最高MOS额定值也证明了所提出算法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号