首页> 外文会议>International Conference on speech and computer >Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming
【24h】

Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming

机译:基于频谱屏蔽和MVDR波束形成的非平稳噪声下低SNR的远场语音增强

获取原文

摘要

Low Signal to Noise Ratio (SNR) conditions are highly likely during remote speech acquisition. This paper handles a method of remote speech multi-channel signal processing for speech enhancement in presence of strong nonstationary noise. The presented approach builds upon the Minimum Variance Distortionless response (MVDR) method, additionally filtering the multi-channel signal prior to MVDR beamforming coefficient estimation with a spectral mask. This mask is obtained by applying mixture observation vector clustering based on a spatial correlation model, which is estimated by a Complex Gaussian Mixture Model (CGMM). The posterior probabilities obtained during the CGMM Expectation-Maximization (EM) algorithm are used to estimate the cumulative noise mask, which is applied to the mixture. The masked mixture is then used to calculate the MVDR covariance matrix and beamforming coefficients. The method is tested on four mixtures acquired using a 66 microphone array at various low SNR. The results are compared to conventional MVDR and several other methods and validated using the Signal to Distortion Ratio (SDR) improvement metric. The results show that the presented method gives SDR improvement no less than 1-1.5 dB in the majority of cases, compared to MVDR, and performs best specifically at low SNR of -15- -20 dB.
机译:在远程语音获取期间很可能出现低信噪比(SNR)的情况。本文介绍了一种在强烈的非平稳噪声存在下进行语音增强的远程语音多通道信号处理方法。提出的方法建立在最小方差无失真响应(MVDR)方法的基础上,在使用频谱模板进行MVDR波束成形系数估计之前,还对多通道信号进行了滤波。通过应用基于空间相关性模型的混合观测矢量聚类获得此蒙版,该空间相关性模型由复杂高斯混合模型(CGMM)估算。在CGMM期望最大化(EM)算法期间获得的后验概率用于估计应用于混合物的累积噪声蒙版。然后,将被掩盖的混合物用于计算MVDR协方差矩阵和波束形成系数。该方法在使用66个麦克风阵列以各种低SNR采集的四种混合物上进行了测试。将结果与常规MVDR和其他几种方法进行比较,并使用信噪比(SDR)改进指标进行了验证。结果表明,与MVDR相比,所提出的方法在大多数情况下可使SDR改善不少于1-1.5 dB,并且在-15至-20 dB的低SNR时表现最佳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号