首页> 外文期刊>Speech Communication >A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments
【24h】

A spatio-temporal speech enhancement scheme for robust speech recognition in noisy environments

机译:时空语音增强方案,用于嘈杂环境中的健壮语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

A new speech enhancement scheme is presented integrating spatial and temporal signal processing methods for robust speech recognition in noisy environments. The scheme first separates spatially localized point sources from noisy speech signals recorded by two microphones. Blind source separation algorithms assuming no a priori knowledge about the sources involved are applied in this spatial processing stage. Then denoising of distributed background noise is achieved in a combined spatial/temporal processing approach. The desired speaker signal is first processed along with an artificially constructed noise signal in a supplementary blind source separation step. It is further denoised by exploiting differences in temporal speech and noise statistics in a wavelet filterbank. The scheme's performance is illustrated by speech recognition experiments on real recordings in a noisy car environment. In comparison to a common multi-microphone technique like beamforming with spectral subtraction, the scheme is shown to enable more accurate speech recognition in the presence of a highly interfering point source and strong background noise.
机译:提出了一种新的语音增强方案,该方案集成了空间和时间信号处理方法,可在嘈杂的环境中进行鲁棒的语音识别。该方案首先从两个麦克风记录的嘈杂语音信号中分离出空间局部点源。假设没有有关源的先验知识的盲源分离算法将应用于此空间处理阶段。然后,以组合的空间/时间处理方法实现对分布式背景噪声的去噪。首先,在辅助盲源分离步骤中,将所需的扬声器信号与人工构建的噪声信号一起进行处理。通过利用小波滤波器组中的时间语音和噪声统计数据的差异来进一步消除噪声。该方案的性能通过在嘈杂的汽车环境中对真实录音进行语音识别实验来说明。与常见的多麦克风技术(例如带频谱减法的波束成形)相比,该方案在存在高度干扰的点源和强烈的背景噪声的情况下,能够实现更准确的语音识别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号