首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement
【24h】

Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement

机译:用于语音增强的非平稳噪声功率谱的鲁棒估计

获取原文
获取原文并翻译 | 示例

摘要

We propose a novel method for noise power spectrum estimation in speech enhancement. This method called extended-DATE (E-DATE) extends the -dimensional amplitude trimmed estimator (DATE), originally introduced for additive white gaussian noise power spectrum estimation in “Robust estimation of noise standard deviation in presence of signals with unknown distributions and occurrences” (D. Pastor and F. Socheleau, IEEE Trans. Signal Processing, vol. 60, no. 4, pp. 1545–1555, Apr. 2012) to the more challenging scenario of non-stationary noise. The key idea is that, in each frequency bin and within a sufficiently short time period, the noise instantaneous power spectrum can be considered as approximately constant and estimated as the variance of a complex gaussian noise process possibly observed in the presence of the signal of interest. The proposed method relies on the fact that the Short-Time Fourier Transform (STFT) of noisy speech signals is sparse in the sense that transformed speech signals can be represented by a relatively small number of coefficients with large amplitudes in the time-frequency domain. The E-DATE estimator is robust in that it does not require prior information about the signal probability distribution except for the weak-sparseness property. In comparison to other state-of-the-art methods, the E-DATE is found to require the smallest number of parameters (only two). The performance of the proposed estimator has been evaluated in combination with noise reduction and compared to alternative methods. This evaluation involves objective as well as pseudo-subjective criteria.
机译:我们提出了一种语音增强中噪声功率谱估计的新方法。这种称为扩展日期(E-DATE)的方法扩展了维度幅度修剪估计器(DATE),该方法最初是为了在“存在未知分布和出现的信号的情况下对噪声标准偏差进行鲁棒估计”而引入的,用于加性高斯白噪声功率谱估计。 (D. Pastor和F. Socheleau,IEEE Trans。Signal Processing,第60卷,第4期,第1545-1555页,2012年4月)针对更具挑战性的非平稳噪声场景。关键思想是,在每个频率仓中且在足够短的时间段内,噪声瞬时功率谱可被视为近似恒定,并估计为存在感兴趣信号时可能观察到的复杂高斯噪声过程的方差。所提出的方法基于这样的事实,即在可以在时频域中用相对少量的具有大幅度的系数来表示变换后的语音信号的情况下,嘈杂语音信号的短时傅立叶变换(STFT)稀疏。 E-DATE估计器具有鲁棒性,因为它除了弱稀疏特性外,不需要有关信号概率分布的先验信息。与其他最新方法相比,发现E-DATE需要最少数量的参数(仅两个)。已结合降噪评估了拟议估算器的性能,并与替代方法进行了比较。该评估涉及客观以及伪主观标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号