首页> 外文会议>INTERSPEECH 2012 >Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs
【24h】

Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs

机译:低SNR,扬声器依赖性语音增强,使用GMMS和MFCCS

获取原文

摘要

In this paper, we propose a two-stage speech enhancement technique. In the training stage, a Gaussian Mixture Model (GMM) of the mel-frequency cepstral coefficients (MFCCs) of a user's clean speech is computed wherein the component densities of the GMM serve to model the user's "acoustic classes." In the enhancement stage, MFCCs from a noisy speech signal are computed and the underlying clean acoustic class is identified via a maximum a posteriori (MAP) decision and a novel mapping matrix. The associated GMM parameters are then used to estimate the MFCCs of the clean speech from the MFCCs of the noisy speech. Finally, the estimated MFCCs are transformed back to a time-domain waveform. Our results show that we can improve PESQ in environments as low as-10 dB SNR.
机译:在本文中,我们提出了一种两级语音增强技术。在训练阶段,计算用户清洁语音的麦克朗谱系数(MFCC)的高斯混合模型(GMM),其中GMM的组件密度用于建模用户的“声学类”。在增强阶段,计算来自噪声语音信号的MFCC,并且通过最大后验(MAP)决策和新颖的映射矩阵来识别底层清洁声学类。然后,关联的GMM参数用于估计来自嘈杂语音的MFCC的清洁语音的MFCC。最后,估计的MFCC被转换回时域波形。我们的结果表明,我们可以在低至10 dB SNR的环境中改进PESQ。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号