首页> 外文会议>IEEE/ACIS International Conference on Computer and Information Science >Auditory mask estimation by RPCA for monaural speech enhancement
【24h】

Auditory mask estimation by RPCA for monaural speech enhancement

机译:RPCA的听觉掩膜估计用于单声道语音增强

获取原文

摘要

Mask estimation has shown a lot of promise in speech enhancement for its simplicity and large speech intelligibility improvement. In this paper, the gammachirp filter banks are applied on the contaminated speech signal to get the auditory time-frequency representation. Robust principal component analysis with non-negative constraint is employed to decompose the auditory time-frequency representation into sparse and low-rank components using alternating direction method of multipliers optimization algorithm. Auditory Mask is estimated by these two parts which are correspond to the speech and noise. Consider that binary mask produces separated sources with more distortion than soft mask estimation. Auditory mask estimation is based on the ideal ratio mask estimation. Experimental results show that the proposed method could achieve better performance in terms of PESQ and LSD compared with multiband spectral subtraction and Robust principal component analysis methods.
机译:掩码估计由于其简单性和语音清晰度的提高而在语音增强方面显示出了广阔的前景。本文将gammachirp滤波器组应用于受污染的语音信号,以获得听觉时频表示。采用非负约束的鲁棒主成分分析方法,采用乘数优化算法的交替方向法将听觉时频表示分解为稀疏和低秩成分。听觉遮罩是由与语音和噪声相对应的这两个部分估计的。考虑到二进制掩码产生的分离源比软掩码估计的失真更大。听觉掩膜估计基于理想比率掩膜估计。实验结果表明,与多频带谱减法和鲁棒主成分分析方法相比,该方法在PESQ和LSD方面可以获得更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号