首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Reduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR
【24h】

Reduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR

机译:通过整合语音和噪声的频谱和位置特征来减少高度不稳定的环境噪声,以实现稳健的ASR

获取原文

摘要

This paper proposes a new multi-channel noise reduction approach that can appropriately handle highly nonstationary noise based on the spectral and locational features of speech and noise. We focus on a distant talking scenario, where a 2-ch microphone array receives a target speaker's voice from the front while it receives highly nonstationary ambient noise from any direction. To cope well with this scenario, we introduce prior training not only for the spectral features of speech and noise but also for their locational features, and utilize them in a unified manner. The proposed method can distinguish rapid changes in speech and noise based mainly on their locational features, while it can reliably estimate the spectral shapes of the speech based largely on the spectral features. A filter-bank based implementation is also discussed to enable the proposed method to work in real time. Experiments using the PASCAL CHiME separation and recognition challenge task show the superiority of the proposed method as regards both speech quality and automatic speech recognition performance.
机译:本文提出了一种新的多通道降噪方法,该方法可以根据语音和噪声的频谱和位置特征,适当地处理高度不稳定的噪声。我们关注的是一个遥远的谈话场景,其中一个2声道麦克风阵列从正面接收目标扬声器的声音,而从任何方向接收高度不稳定的环境噪声。为了很好地应对这种情况,我们不仅针对语音和噪声的频谱特征而且针对其位置特征引入了先验训练,并以统一的方式加以利用。所提出的方法可以主要基于它们的位置特征来区分语音和噪声的快速变化,而它可以主要基于频谱特征来可靠地估计语音的频谱形状。还讨论了基于滤波器组的实现,以使所提出的方法能够实时工作。使用PASCAL CHiME分离和识别挑战任务的实验表明,该方法在语音质量和自动语音识别性能方面均具有优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号