首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Multistream Bandpass Modulation Features for Robust Speech Recognition
【24h】

Multistream Bandpass Modulation Features for Robust Speech Recognition

机译:多流带通调制功能可实现可靠的语音识别

获取原文

摘要

Current understanding of speech processing in the brain suggests dual streams of processing of temporal and spectral information, whereby slow vs. fast modulations are analyzed along parallel paths that encode various scales of information in speech signals. This unique way for the biology to analyze the multiplicity of information in speech signals along parallel paths can bare great lessons for feature extraction front-ends in speech processing systems, particularly for dealing with extrinsic degradations and unseen noise distortions. Here, we propose a multistream approach to feature analysis for robust speaker-independent phoneme recognition in presence of nonstationary background noises. The scheme presented here centers around a multi-path bandpass modulation analysis of speech sounds with each stream covering an entire range of temporal and spectral modulations. By performing bandpass operations of slow vs. fast information along the spectral and temporal dimensions, the proposed scheme avoids the classic feature explosion problem of previous multistream approaches while maintaining the advantage of parallelism and localized feature analysis. The proposed architecture results in substantial improvements over standard baseline features and two state-of-the-art noise robust feature schemes.
机译:当前对大脑中语音处理的理解提出了时间和频谱信息处理的双重流,从而沿编码语音信号中各种信息量的并行路径分析了慢速调制与快速调制。这种独特的生物学方法可以分析语音信号沿并行路径的多样性,这可以为语音处理系统中的特征提取前端(特别是处理外部退化和看不见的噪声失真)提供重要的经验教训。在这里,我们提出了一种多流方法来进行特征分析,以在存在非平稳背景噪声的情况下实现健壮的独立于说话人的音素识别。这里介绍的方案围绕语音的多路径带通调制分析,每个流覆盖整个时间和频谱调制范围。通过沿频谱和时间维度执行慢速信息与快速信息的带通操作,所提出的方案避免了先前多流方法的经典特征爆炸问题,同时保持了并行性和局部特征分析的优势。所提出的体系结构对标准基线特征和两个最新的噪声健壮特征方案进行了实质性的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号