首页> 外文期刊>Signal processing >Context-adaptive pre-processing scheme for robust speech recognition in fast-varying noise environment
【24h】

Context-adaptive pre-processing scheme for robust speech recognition in fast-varying noise environment

机译:时变噪声环境下用于语音识别的上下文自适应预处理方案

获取原文
获取原文并翻译 | 示例

摘要

Based on the observation that dissimilar speech enhancement algorithms perform differently for different types of interference and noise conditions, we propose a context-adaptive speech pre-processing scheme, which performs adaptive selection of the most advantageous speech enhancement algorithm for each condition. The selection process is based on an unsupervised clustering of the acoustic feature space and a subsequent mapping function that identifies the most appropriate speech enhancement channel for each audio input, corresponding to unknown environmental conditions. Experiments performed on the MoveOn motorcycle speech and noise database validate the practical value of the proposed scheme for speech enhancement and demonstrate a significant improvement in terms of speech recognition accuracy, when compared to the one of the best performing individual speech enhancement algorithm. This is expressed as accuracy gain of 3.3% in terms of word recognition rate. The advance offered in the present work reaches beyond the specifics of the present application, and can be beneficial to spoken interfaces operating in fast-varying noise environments.
机译:基于不同的语音增强算法在不同类型的干扰和噪声条件下执行效果不同的观察,我们提出了一种上下文自适应语音预处理方案,该方案针对每种条件对最有利的语音增强算法进行自适应选择。选择过程基于声学特征空间的无监督聚类和随后的映射功能,该映射功能为每个音频输入标识了最合适的语音增强通道,对应于未知的环境条件。与性能最佳的单个语音增强算法之一相比,在MoveOn摩托车语音和噪声数据库上进行的实验验证了所提出的语音增强方案的实用价值,并证明了语音识别准确性方面的显着提高。这表示为基于单词识别率的3.3%的准确度增益。当前工作中提供的进步超出了本申请的范围,并且对于在快速变化的噪声环境中运行的口语界面可能是有益的。

著录项

  • 来源
    《Signal processing》 |2011年第8期|p.2101-2111|共11页
  • 作者单位

    Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500 Rion-Patras, Greece;

    Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500 Rion-Patras, Greece;

    Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500 Rion-Patras, Greece;

    Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500 Rion-Patras, Greece;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    fast-varying noise; speech enhancement; speech pre-processing; speech recognition; motorcycle environment;

    机译:时变噪声;语音增强;语音预处理;语音识别;摩托车环境;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号