Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation

机译：利用频谱-时间相关性改进基于语音存在概率的噪声功率估计

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

For the enhancement of speech degraded by noise, accurate estimation of the noise power spectral density (PSD) is indispensable, especially if only a single microphone signal is available. Fast and accurate tracking of the noise PSD is particularly challenging in highly non-stationary noise types, since the distinction between speech and noise components becomes more difficult. Short-time discrete Fourier transform (STFT) based noise PSD estimation algorithms which employ estimates of the speech presence probability (SPP) with fixed priors have been shown to yield good tracking performance even in adverse noise conditions. In this paper, we compare two methods to incorporate spectro-temporal correlations to improve the tracking performance. The first method smoothes the noisy observation over time and frequency before computing the SPP, while the second is based on a Hidden Markov Model (HMM) of the speech presence and absence states. We show that the proposed modifications lead to improved noise PSD estimators which are less sensitive to spectral outliers of the noise and track changes in the noise PSD more quickly than the reference method. Further, when employed in a common speech enhancement setup, the proposed estimators achieve an increased noise reduction while keeping speech distortions at a comparable level.

机译：为了增强因噪声而导致的语音质量下降，准确估计噪声功率谱密度（PSD）是必不可少的，尤其是在只有单个麦克风信号可用的情况下。在高度不稳定的噪声类型中，快速，准确地跟踪噪声PSD尤其具有挑战性，因为语音和噪声分量之间的区别变得更加困难。基于短时离散傅里叶变换（STFT）的噪声PSD估计算法已经证明，即使在不利的噪声条件下，该算法也可以使用具有固定先验的语音存在概率（SPP）的估计来产生良好的跟踪性能。在本文中，我们比较了两种方法，以结合光谱时空相关性以提高跟踪性能。第一种方法在计算SPP之前会在时间和频率上平滑噪声观测，而第二种方法则基于语音存在和不存在状态的隐马尔可夫模型（HMM）。我们表明，提出的修改导致改进的噪声PSD估计器，该估计器对噪声的频谱离群值较不敏感，并且比参考方法更快地跟踪了噪声PSD中的变化。此外，当在通用语音增强设置中使用时，所提出的估计器在将语音失真保持在可比较水平的同时实现了增加的噪声降低。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|365-369|共5页
会议地点
作者
Krawczyk-Becker Martin; Fischer Dorte; Gerkmann Timo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
noise power estimation; noise reduction; speech enhancement;

机译：噪声功率估计;降噪;语音增强;

相似文献

外文文献
中文文献
专利

1. Improved speech absence probability estimation based on environmental noise classification [J] . SON Young-ho, LEE Sang-min 中南大学学报（英文版） . 2012,第009期

机译：基于环境噪声分类的改进语音缺席概率估计
2. Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors [J] . Gerkmann T., Breithaupt C., Martin R. IEEE transactions on audio, speech and language processing . 2008,第5期

机译：基于固定先验的似然比的改进后验语音存在概率估计
3. Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors [J] . Gerkmann T., Breithaupt C., Martin R. IEEE transactions on audio, speech and language processing . 2008,第5期

机译：基于固定先验的似然比的改进后验语音存在概率估计
4. Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation [C] . Krawczyk-Becker Martin, Fischer Dorte, Gerkmann Timo IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：利用用于改进的语音存在概率的噪声功率估计的光谱 - 时间相关性
5. Array-based Spectro-temporal Masking for Automatic Speech Recognition. [D] . Moghimi, Amir R. 2014

机译：基于阵列的频谱时域掩蔽，用于自动语音识别。
6. Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users [O] . Tobias Goehring, Federico Bolner, Jessica J.M. Monaghan, -1

机译：基于神经网络的语音增强功能可提高人工耳蜗用户的语音清晰度
7. Noise power estimation based on the probability of speech presence [O] . Timo Gerkmann, Richard C. Hendriks 2011

机译：基于语音存在概率的噪声功率估计
8. Optimum Classification of Voiced Speech, Unvoiced Speech and Silence in the Presence of Noise and Interference. [R] . mcaulay,robert j. -1

机译：浊音的最佳分类，清音和沉默的噪音和干扰的情况下。

Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅