首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation
【24h】

Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation

机译:利用频谱-时间相关性改进基于语音存在概率的噪声功率估计

获取原文
获取外文期刊封面目录资料

摘要

For the enhancement of speech degraded by noise, accurate estimation of the noise power spectral density (PSD) is indispensable, especially if only a single microphone signal is available. Fast and accurate tracking of the noise PSD is particularly challenging in highly non-stationary noise types, since the distinction between speech and noise components becomes more difficult. Short-time discrete Fourier transform (STFT) based noise PSD estimation algorithms which employ estimates of the speech presence probability (SPP) with fixed priors have been shown to yield good tracking performance even in adverse noise conditions. In this paper, we compare two methods to incorporate spectro-temporal correlations to improve the tracking performance. The first method smoothes the noisy observation over time and frequency before computing the SPP, while the second is based on a Hidden Markov Model (HMM) of the speech presence and absence states. We show that the proposed modifications lead to improved noise PSD estimators which are less sensitive to spectral outliers of the noise and track changes in the noise PSD more quickly than the reference method. Further, when employed in a common speech enhancement setup, the proposed estimators achieve an increased noise reduction while keeping speech distortions at a comparable level.
机译:为了增强因噪声而导致的语音质量下降,准确估计噪声功率谱密度(PSD)是必不可少的,尤其是在只有单个麦克风信号可用的情况下。在高度不稳定的噪声类型中,快速,准确地跟踪噪声PSD尤其具有挑战性,因为语音和噪声分量之间的区别变得更加困难。基于短时离散傅里叶变换(STFT)的噪声PSD估计算法已经证明,即使在不利的噪声条件下,该算法也可以使用具有固定先验的语音存在概率(SPP)的估计来产生良好的跟踪性能。在本文中,我们比较了两种方法,以结合光谱时空相关性以提高跟踪性能。第一种方法在计算SPP之前会在时间和频率上平滑噪声观测,而第二种方法则基于语音存在和不存在状态的隐马尔可夫模型(HMM)。我们表明,提出的修改导致改进的噪声PSD估计器,该估计器对噪声的频谱离群值较不敏感,并且比参考方法更快地跟踪了噪声PSD中的变化。此外,当在通用语音增强设置中使用时,所提出的估计器在将语音失真保持在可比较水平的同时实现了增加的噪声降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号