首页> 外文期刊>IEEE transactions on audio, speech and language processing >Robust speech recognition in noisy environments based on subband spectral centroid histograms
【24h】

Robust speech recognition in noisy environments based on subband spectral centroid histograms

机译:基于子带频谱质心直方图的嘈杂环境中的鲁棒语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

We investigate how dominant-frequency information can be used in speech feature extraction to increase the robustness of automatic speech recognition against additive background noise. First, we review several earlier proposed auditory-based feature extraction methods and argue that the use of dominant-frequency information might be one of the major reasons for their improved noise robustness. Furthermore, we propose a new feature extraction method, which combines subband power information with dominant subband frequency information in a simple and computationally efficient way. The proposed features are shown to be considerably more robust against additive background noise than standard mel-frequency cepstrum coefficients on two different recognition tasks. The performance improvement increased as we moved from a small-vocabulary isolated-word task to a medium-vocabulary continuous-speech task, where the proposed features also outperformed a computationally expensive auditory-based method. The greatest improvement was obtained for noise types characterized by a relatively flat spectral density.
机译:我们研究如何将主导频率信息用于语音特征提取,以提高针对附加背景噪声的自动语音识别的鲁棒性。首先,我们回顾了几种较早提出的基于听觉的特征提取方法,并认为使用主导频率信息可能是其改善噪声鲁棒性的主要原因之一。此外,我们提出了一种新的特征提取方法,该方法以简单且计算有效的方式将子带功率信息与主要子带频率信息相结合。在两个不同的识别任务上,与标准的mel-频率倒谱系数相比,拟议的功能在抵抗加性背景噪声方面表现出更强大的性能。当我们从小词汇孤立词任务转移到中等词汇连续语音任务时,性能提高有所提高,其中建议的功能也胜过基于计算的听觉方法。对于以相对平坦的频谱密度为特征的噪声类型,获得了最大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号