...
首页> 外文期刊>International journal of speech technology >Speech enhancement using MMSE estimation under phase uncertainty
【24h】

Speech enhancement using MMSE estimation under phase uncertainty

机译:在相位不确定性下使用MMSE估计进行语音增强

获取原文
获取原文并翻译 | 示例
           

摘要

Most of the speech enhancement algorithms process the amplitudes of speech, but the phase of noisy speech is left unprocessed as it may cause undesired artifacts. Recently, short time Fourier transform based single channel speech enhancement algorithms are developed by considering uncertain prior knowledge of phase. The uncertain knowledge of the phase is obtained from the phase reconstruction algorithms. The goal of this paper is to develop joint minimum mean square error estimate of complex speech coefficients given uncertainty phase (CUP) information by considering Nagakami probability density function (PDF) and gamma PDF as speech spectral amplitude priors and generalized gamma PDF for noise prior. Estimators like amplitudes given uncertainty phase, which uses uncertain phase only for amplitude estimation and not for phase improvement are developed. Experimental results shows that incorporating uncertain phase information improves quality and intelligibility of speech. Also novel phase-blind estimators are developed using Nagakami PDF/ gamma as speech priors and generalized gamma as noise prior. Finally comparison of all estimators using uncertain prior phase information is discussed and how initial phase information affects the enhancement process is analyzed with novel estimators. For comparison of all the derived estimators, the speech signals uttered by male and female speakers are taken from TIMIT database. The proposed CUP estimators outperforms the existing algorithms in terms of objective performance measure segmental signal to noise ratio, phase signal to noise ratio, perceptual evaluation of speech quality, short time objective intelligibility.
机译:大多数语音增强算法都处理语音幅度,但是嘈杂语音的相位则不予处理,因为它可能会导致不希望的伪像。最近,通过考虑不确定的相位先验知识,开发了基于短时傅立叶变换的单通道语音增强算法。相位的不确定性是从相位重建算法中获得的。本文的目标是通过将Nagakami概率密度函数(PDF)和gamma PDF视为语音频谱幅度先验值,并将广义gamma PDF作为噪声先验值,在给定不确定性相位(CUP)信息的情况下,开发复杂语音系数的联合最小均方误差估计。已开发出了类似幅度给定不确定性相位的估计器,该估计器仅将不确定性相位用于幅度估计而不用于相位改善。实验结果表明,合并不确定的相位信息可以提高语音质量和清晰度。还使用Nagakami PDF / gamma作为语音先验和广义gamma作为噪声先验开发了新颖的相位盲估计器。最后讨论了使用不确定的先验相位信息对所有估计器的比较,并使用新颖的估计器分析了初始相位信息如何影响增强过程。为了比较所有导出的估计量,从TIMIT数据库中提取了男性和女性说话者发出的语音信号。拟议的CUP估计器在目标性能测量,分段信噪比,相位信噪比,语音质量的感知评估,短时目标清晰度方面优于现有算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号