Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System

Hansen J.H.L.; Radhakrishnan V.; Arehart K.H.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System

【24h】

Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System

机译：基于广义最小均方误差估计器和听觉系统掩蔽特性的语音增强

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, the family of conditional minimum mean square error (MMSE) spectral estimators is studied which take on the form$(E(X_p^alpha/vert X_p+D_pvert))^1/alpha$, where$X_p$is the clean speech spectrum, and$D_p$is the noise spectrum, resulting in a Generalized MMSE estimator (GMMSE). The degree of noise suppression versus musical tone artifacts of these estimators is studied. The tradeoffs in selection of$(alpha)$, across noise spectral structure and signal-to-noise ratio (SNR) level, are also considered. Members of this family of estimators include the Ephraim–Malah (EM) amplitude estimator and, for high SNRs, the Wiener Filter. It is shown that the colorless residual noise observed in the EM estimator is a characteristic of this general family of estimators. An application of these estimators in an auditory enhancement scheme using the masking threshold of the human auditory system is formulated, resulting in the GMMSE-auditory masking threshold (AMT) enhancement method. Finally, a detailed evaluation of the proposed algorithms is performed over the phonetically balanced TIMIT database and the National Gallery of the Spoken Word (NGSW) audio archive using subjective and objective speech quality measures. Results show that the proposed GMMSE-AMT outperforms MMSE and log-MMSE enhancement methods using a detailed phoneme-based objective quality analysis.

机译：本文研究了条件最小均方误差（MMSE）谱估计量族，其形式为$ {E（X_p ^ alpha / vert X_p + D_pvert））^ 1 / alpha $，其中$ X_p $为干净的语音频谱，而$ D_p $是噪声频谱，从而得到广义MMSE估计器（GMMSE）。研究了这些估计器的噪声抑制程度与乐音伪像的关系。还考虑了在噪声频谱结构和信噪比（SNR）级别之间选择$α$的权衡。该估计器系列的成员包括Ephraim-Malah（EM）幅度估计器，以及对于高SNR的Wiener滤波器。结果表明，在EM估计器中观察到的无色残留噪声是该一般估计器系列的特征。制定了这些估计器在使用人类听觉系统掩蔽阈值的听觉增强方案中的应用，从而形成了GMMSE-听觉掩蔽阈值（AMT）增强方法。最后，使用主观和客观语音质量度量，在语音平衡的TIMIT数据库和国家美术馆的口语（NGSW）音频档案上对提出的算法进行了详细评估。结果表明，使用详细的基于音素的客观质量分析，所提出的GMMSE-AMT优于MMSE和log-MMSE增强方法。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2006年第6期|p.2049-2063|共15页
作者
Hansen J.H.L.; Radhakrishnan V.; Arehart K.H.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Auditory masked threshold (AMT); Weiner filter; denoising; generalized minimum mean square error (GMMSE)-AMT; log-minimum mean square error (MMSE); noise suppression; speech enhancement; Auditory masked threshold (AMT); Weiner filter; denoising; generalized minimu;

机译：听觉掩蔽阈值（AMT）;Weiner滤波器;去噪;广义最小均方误差（GMMSE）-AMT;对数最小均方误差（MMSE）;噪声抑制;语音增强;听觉掩蔽阈值（AMT）;Weiner滤波器;去噪广义最小;

相似文献

外文文献
中文文献
专利

1. Speech Enhancement Using Minimum Mean-Square Error Amplitude Estimators Under Normal and Generalized Gamma Distribution [J] . Chabane Boubakir, Daoud Berkani Journal of computer sciences . 2010,第7期

机译：在正态和广义Gamma分布下使用最小均方误差幅度估计器进行语音增强
2. Speech Enhancement Using Minimum Mean-Square Error Amplitude Estimators Under Normal and Generalized Gamma Distribution | Science Publications [J] . Chabane Boubakir, Daoud Berkani Journal of computer sciences . 2010,第7期

机译：正态和广义伽马分布下使用最小均方误差幅度估计器进行语音增强科学出版物
3. Minimum mean square error estimator for speech enhancement in additive noise assuming Weibull speech priors and speech presence uncertainty [J] . Mojtaba Bahrami, Neda Faraji International journal of speech technology . 2021,第1期

机译：威布尔语音Priors和语音存在不确定性的添加剂噪声语音增强的最小均方误差估计
4. MINIMUM MEAN-SQUARE ERROR AMPLITUDE ESTIMATORS FOR SPEECH ENHANCEMENT UNDER THE GENERALIZED GAMMA DISTRIBUTION [C] . R. C. Hendriks, J. S. Erkelens, J. Jensen, International Workshop on Acoustic Echo and Noise Control . 2006

机译：广义伽马分布下语音增强的最小均方误差估计
5. Speech enhancement algorithms using Kalman filtering and masking properties of human auditory systems. [D] . Ma, Ning. 2005

机译：使用卡尔曼滤波和人类听觉系统掩蔽属性的语音增强算法。
6. Reducing Bias and Mean Squared Error Associated With Regression-Based Odds Ratio Estimators [O] . Robert H. Lyles, Ying Guo, Sander Greenland -1

机译：减少与基于回归的差距估计相关的偏差和平均平方误差
7. Speech Enhancement Using Minimum Mean-Square Error Amplitude Estimators Under Normal and Generalized Gamma Distribution [O] . Chabane Boubakir, Daoud Berkani 2010

机译：在正态和广义Gamma分布下使用最小均方误差幅度估计器进行语音增强
8. Generalized Mean Squared Error Properties of Regression Estimators. [R] . Gunst, R. F., Mason, R. L. 1976

机译：回归估计的广义均方误差性质。

Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅