Highlights'/> Rank-1 constrained Multichannel Wiener Filter for speech recognition in noisy environments
首页> 外文期刊>Computer speech and language >Rank-1 constrained Multichannel Wiener Filter for speech recognition in noisy environments
【24h】

Rank-1 constrained Multichannel Wiener Filter for speech recognition in noisy environments

机译:适用于嘈杂环境中语音识别的Rank-1约束多通道维纳滤波器

获取原文
获取原文并翻译 | 示例
           

摘要

HighlightsA constant residual noise power constraint for rank-1 MWF is proposed.Speech covariance matrix reconstruction to fulfill the rank-1 assumption.An extensive comparison of the multichannel linear filters supported by BLSTM.A feature variance metric that correlates with the word error rate.AbstractMultichannel linear filters, such as the Multichannel Wiener Filter (MWF) and the Generalized Eigenvalue (GEV) beamformer are popular signal processing techniques which can improve speech recognition performance. In this paper, we present an experimental study on these linear filters in a specific speech recognition task, namely the CHiME-4 challenge, which features real recordings in multiple noisy environments. Specifically, the rank-1 MWF is employed for noise reduction and a new constant residual noise power constraint is derived which enhances the recognition performance. To fulfill the underlying rank-1 assumption, the speech covariance matrix is reconstructed based on eigenvectors or generalized eigenvectors. Then the rank-1 constrained MWF is evaluated with alternative multichannel linear filters under the same framework, which involves a Bidirectional Long Short-Term Memory (BLSTM) network for mask estimation. The proposed filter outperforms alternative ones, leading to a 40% relative Word Error Rate (WER) reduction compared with the baseline Weighted Delay and Sum (WDAS) beamformer on the real test set, and a 15% relative WER reduction compared with the GEV-BAN method. The results also suggest that the speech recognition accuracy correlates more with the Mel-frequency cepstral coefficients (MFCC) feature variance than with the noise reduction or the speech distortion level.
机译: 突出显示 < ce:list-item id = “ celistitem0001 ”> 恒定的残余噪声功率约束建议用于等级1 MWF。 语音协方差矩阵重构,以满足1级假设。 广泛比较了BLSTM支持的多通道线性滤波器。 与字错误率相关的特征差异度量。 摘要

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号