Speech enhancement using MMSE estimation under phase uncertainty

Ravikumar Kandagatla; P. V. Subbaiah

首页> 外文期刊>International journal of speech technology >Speech enhancement using MMSE estimation under phase uncertainty

【24h】

Speech enhancement using MMSE estimation under phase uncertainty

机译：在相位不确定性下使用MMSE估计进行语音增强

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of the speech enhancement algorithms process the amplitudes of speech, but the phase of noisy speech is left unprocessed as it may cause undesired artifacts. Recently, short time Fourier transform based single channel speech enhancement algorithms are developed by considering uncertain prior knowledge of phase. The uncertain knowledge of the phase is obtained from the phase reconstruction algorithms. The goal of this paper is to develop joint minimum mean square error estimate of complex speech coefficients given uncertainty phase (CUP) information by considering Nagakami probability density function (PDF) and gamma PDF as speech spectral amplitude priors and generalized gamma PDF for noise prior. Estimators like amplitudes given uncertainty phase, which uses uncertain phase only for amplitude estimation and not for phase improvement are developed. Experimental results shows that incorporating uncertain phase information improves quality and intelligibility of speech. Also novel phase-blind estimators are developed using Nagakami PDF/ gamma as speech priors and generalized gamma as noise prior. Finally comparison of all estimators using uncertain prior phase information is discussed and how initial phase information affects the enhancement process is analyzed with novel estimators. For comparison of all the derived estimators, the speech signals uttered by male and female speakers are taken from TIMIT database. The proposed CUP estimators outperforms the existing algorithms in terms of objective performance measure segmental signal to noise ratio, phase signal to noise ratio, perceptual evaluation of speech quality, short time objective intelligibility.

机译：大多数语音增强算法都处理语音幅度，但是嘈杂语音的相位则不予处理，因为它可能会导致不希望的伪像。最近，通过考虑不确定的相位先验知识，开发了基于短时傅立叶变换的单通道语音增强算法。相位的不确定性是从相位重建算法中获得的。本文的目标是通过将Nagakami概率密度函数（PDF）和gamma PDF视为语音频谱幅度先验值，并将广义gamma PDF作为噪声先验值，在给定不确定性相位（CUP）信息的情况下，开发复杂语音系数的联合最小均方误差估计。已开发出了类似幅度给定不确定性相位的估计器，该估计器仅将不确定性相位用于幅度估计而不用于相位改善。实验结果表明，合并不确定的相位信息可以提高语音质量和清晰度。还使用Nagakami PDF / gamma作为语音先验和广义gamma作为噪声先验开发了新颖的相位盲估计器。最后讨论了使用不确定的先验相位信息对所有估计器的比较，并使用新颖的估计器分析了初始相位信息如何影响增强过程。为了比较所有导出的估计量，从TIMIT数据库中提取了男性和女性说话者发出的语音信号。拟议的CUP估计器在目标性能测量，分段信噪比，相位信噪比，语音质量的感知评估，短时目标清晰度方面优于现有算法。

著录项

来源
《International journal of speech technology》 |2017年第2期|373-385|共13页
作者
Ravikumar Kandagatla; P. V. Subbaiah;
展开▼
作者单位

Lakireddy Baliredy Engineering College, Mylavaram, Krishna District, Andhra Pradesh, India;

Velagapudi Siddhartha Engineering College, Vijayawada, Krishna District, Andhra Pradesh, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech enhancement; Von misses distribution; Generalized gamma distribution; Noise reduction; Phase uncertainty;

机译：语音增强;冯·米斯分布;广义伽玛分布;降噪;相位不确定度;

相似文献

外文文献
中文文献
专利

1. On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty [J] . Martin Krawczyk-Becker, Timo Gerkmann Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第12期

机译：基于MMSE的相位不确定幅度和复语音频谱系数估计。
2. Efficient MMSE Estimation and Uncertainty Processing for Multienvironment Robust Speech Recognition [J] . González J.A., Peinado A.M., Gómez A.M., Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：多环境鲁棒语音识别的高效MMSE估计和不确定性处理
3. Speech Enhancement Using Modified MMSE-LSA and Phase Reconstruction in Voiced and Unvoiced Speech [J] . Jia Hairong, Wang Weimei, Wang Dong, International Journal of Pattern Recognition and Artificial Intelligence . 2019,第2期

机译：使用改进的MMSE-LSA进行语音增强和浊音和清音语音的相位重建
4. MMSE Log-Spectral Amplitude Estimation for Single Channel Speech Enhancement Under Speech Presence Uncertainty by Weibull Speech Priors [C] . Mojtaba Bahrami, Sanaz Seyedin Iranian Conference on Electrical Engineering . 2018

机译：Weibull语音先验在语音存在不确定性下用于单通道语音增强的MMSE对数谱幅度估计
5. Modulation domain processing and speech phase spectrum in speech enhancement. [D] . Zhang, Yi. 2012

机译：语音增强中的调制域处理和语音相位谱。
6. A Laplacian-based MMSE estimator for speech enhancement [O] . Bin Chen, Philipos C. Loizou -1

机译：基于Laplacian的MMSE估计器用于语音增强
7. Linking speech enhancement and error concealment based on recursive MMSE estimation [O] . Balázs Fodor, Florian Pflug, Tim Fingscheidt 2015

机译：基于递归MMSE估计的链接语音增强和错误隐藏

Speech enhancement using MMSE estimation under phase uncertainty

摘要

著录项

相似文献

相关主题

期刊订阅