Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

Plapous C.; Marro C.; Scalart P.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

【24h】

Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

机译：用于语音增强的改进信噪比估计

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper addresses the problem of single-microphone speech enhancement in noisy environments. State-of-the-art short-time noise reduction techniques are most often expressed as a spectral gain depending on the signal-to-noise ratio (SNR). The well-known decision-directed (DD) approach drastically limits the level of musical noise, but the estimated a priori SNR is biased since it depends on the speech spectrum estimation in the previous frame. Therefore, the gain function matches the previous frame rather than the current one which degrades the noise reduction performance. The consequence of this bias is an annoying reverberation effect. We propose a method called two-step noise reduction (TSNR) technique which solves this problem while maintaining the benefits of the decision-directed approach. The estimation of the a priori SNR is refined by a second step to remove the bias of the DD approach, thus removing the reverberation effect. However, classic short-time noise reduction techniques, including TSNR, introduce harmonic distortion in enhanced speech because of the unreliability of estimators for small signal-to-noise ratios. This is mainly due to the difficult task of noise power spectrum density (PSD) estimation in single-microphone schemes. To overcome this problem, we propose a method called harmonic regeneration noise reduction (HRNR). A nonlinearity is used to regenerate the degraded harmonics of the distorted signal in an efficient way. The resulting artificial signal is produced in order to refine the a priori SNR used to compute a spectral gain able to preserve the speech harmonics. These methods are analyzed and objective and formal subjective test results between HRNR and TSNR techniques are provided. A significant improvement is brought by HRNR compared to TSNR thanks to the preservation of harmonics.

机译：本文解决了嘈杂环境中单麦克风语音增强的问题。最先进的短时降噪技术通常表示为频谱增益，具体取决于信噪比（SNR）。众所周知的决策导向（DD）方法极大地限制了音乐噪声的水平，但是估计的先验SNR有偏差，因为它取决于前一帧中的语音频谱估计。因此，增益函数匹配前一帧而不是当前帧，这会降低降噪性能。这种偏差的结果是令人讨厌的混响效果。我们提出了一种称为两步降噪（TSNR）技术的方法，该方法可以解决此问题，同时又保留了决策导向方法的优势。通过第二步骤改进先验SNR的估计，以消除DD方法的偏差，从而消除混响效果。但是，传统的短时降噪技术（包括TSNR）会在增强的语音中引入谐波失真，这是因为估算器对小信噪比的可靠性不高。这主要归因于单麦克风方案中的噪声功率谱密度（PSD）估算的艰巨任务。为了克服这个问题，我们提出了一种称为谐波再生降噪（HRNR）的方法。非线性用于以有效方式再生失真信号的降级谐波。产生的人工信号是为了改进先验SNR，该先验SNR用于计算能够保留语音谐波的频谱增益。分析了这些方法，并提供了HRNR和TSNR技术之间的客观和正式的主观测试结果。由于保留了谐波，与TSNR相比，HRNR带来了重大改进。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2006年第6期|p.2098-2108|共11页
作者
Plapous C.; Marro C.; Scalart P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
A posteriori signal-to-noise ratio (SNR); a priori SNR; harmonic regeneration; noise reduction; speech enhancement; A posteriori signal-to-noise ratio (SNR); a priori SNR; harmonic regeneration; noise reduction; speech enhancement;

机译：后验信噪比（SNR）;先验SNR;谐波再生;降噪;语音增强;后验信噪比（SNR）;先验SNR;谐波再生;降噪;语音增强;

相似文献

外文文献
中文文献
专利

1. On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement [J] . Tran Huy Dat, Kazuya Takeda, Fumitada Itakura Speech Communication . 2006,第11期

机译：对数功率域中的在线高斯混合建模，用于信噪比估计和语音增强
2. Improved single channel phase-aware speech enhancement technique for low signal-to-noise ratio signal [J] . Suman Samui, Indrajit Chakrabarti, Soumya Kanti Ghosh Signal Processing, IET . 2016,第6期

机译：低信噪比信号的改进的单通道相位感知语音增强技术
3. A MULTI-FILTER SYSTEM FOR SPEECH ENHANCEMENT UNDER LOW SIGNAL-TO-NOISE RATIOS [J] . K. F. C. Yiu, K. Y. Chan, S. Y. Low, Journal of industrial and management optimization . 2009,第3期

机译：低信噪比下的语音增强多滤波器系统
4. Truth-to-Estimate Ratio Mask: A Post-Processing Method for Speech Enhancement Direct at Low Signal-to-Noise Ratios [C] . Bohan Chen, He Wang, Yue Wei, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：真值比掩码：直接在低信噪比下进行语音增强的后处理方法
5. The signal-to-noise ratio estimation in dispersive absorption spectrometry and new quantitative methods based on the signal-to-noise ratio theory. [D] . Fu, Chifan Thomas. 1998

机译：色散吸收光谱法中的信噪比估算和基于信噪比理论的新定量方法。
6. The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility [O] . Thomas Bentsen, Tobias May, Abigail A. Kressner, 2012

机译：在计算语音隔离中将深度神经网络架构与理想比率掩码估计相结合的好处，可以提高语音清晰度
7. Improved Signal-to-Noise Ratio Estimation for Speech Enhancement [O] . Plapous, Cyril, Marro, Claude, Scalart, Pascal 2006

机译：用于语音增强的改进信噪比估计

Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅