首页> 外文会议>International Conference on Speech and Computer >Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming
【24h】

Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming

机译:基于光谱屏蔽和MVDR波束成形的非间平噪声存在下低SNR的远场语音增强

获取原文

摘要

Low Signal to Noise Ratio (SNR) conditions are highly likely during remote speech acquisition. This paper handles a method of remote speech multi-channel signal processing for speech enhancement in presence of strong nonstationary noise. The presented approach builds upon the Minimum Variance Distortionless response (MVDR) method, additionally filtering the multi-channel signal prior to MVDR beamforming coefficient estimation with a spectral mask. This mask is obtained by applying mixture observation vector clustering based on a spatial correlation model, which is estimated by a Complex Gaussian Mixture Model (CGMM). The posterior probabilities obtained during the CGMM Expectation-Maximization (EM) algorithm are used to estimate the cumulative noise mask, which is applied to the mixture. The masked mixture is then used to calculate the MVDR covariance matrix and beamforming coefficients. The method is tested on four mixtures acquired using a 66 microphone array at various low SNR. The results are compared to conventional MVDR and several other methods and validated using the Signal to Distortion Ratio (SDR) improvement metric. The results show that the presented method gives SDR improvement no less than 1-1.5 dB in the majority of cases, compared to MVDR, and performs best specifically at low SNR of -15- -20 dB.
机译:在远程语音采集期间,低信噪比(SNR)条件很可能是很可能的。本文处理了一种用于存在强烈的非间平噪声的语音增强的远程语音多通道信号处理方法。所提出的方法在最小方差失真响应(MVDR)方法上构建,另外通过光谱掩模在MVDR波束形成系数估计之前滤波多通道信号。通过基于空间相关模型施加混合物观察载体聚类来获得该掩模,其由复杂高斯混合模型(CGMM)估计。在CGMM期望最大化(EM)算法期间获得的后验概率用于估计施加到混合物的累积噪声掩模。然后使用掩蔽的混合物来计算MVDR协方差矩阵和波束形成系数。该方法在使用各种低SNR处使用66个麦克风阵列获取的四个混合物上进行测试。将结果与传统MVDR和几种其他方法进行比较,并使用信号验证以失真率(SDR)改进度量。结果表明,与MVDR相比,呈现的方法在大多数情况下,大多数病例中的SDR改善不小于1-1.5 dB,并且在-15--20 dB的低SNR处表现最佳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号