首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Histogram-based subband powerwarping and spectral averaging for robust speech recognition under matched and multistyle training
【24h】

Histogram-based subband powerwarping and spectral averaging for robust speech recognition under matched and multistyle training

机译:基于直方图的子带功率变形和频谱平均,可在匹配和多样式训练下实现鲁棒的语音识别

获取原文

摘要

This paper describes a new algorithm that increases the robustness of speech recognition systems by matching the power histograms of the input in each frequency band to those obtained over clean training data, and then mixing together the processed and unprocessed spectra. Before calculating prototype histograms over the training data, the power signals in each channel are normalized by the local maximum and minimum of the channel. In contrast, histograms calculated over the testing data are normalized by the global maximum and minimum of the power spectrum. This mode of normalization leads to a significant reduction in noise. Following the histogram-based processing, it is shown that taking a weighted average between the processed and unprocessed power spectra contributes to further gains in recognition accuracy. Results are obtained for multiple speech recognition systems, noise types, and training conditions illustrating the broad utility of this approach.
机译:本文介绍了一种新算法,该算法通过将每个频带中输入的功率直方图与通过干净训练数据获得的功率直方图进行匹配,然后将已处理和未处理的频谱混合在一起,从而提高语音识别系统的鲁棒性。在根据训练数据计算原型直方图之前,通过通道的局部最大值和最小值对每个通道中的功率信号进行归一化。相反,根据测试数据计算出的直方图通过功率谱的全局最大值和最小值进行归一化。这种归一化模式可显着降低噪声。在基于直方图的处理之后,显示出在已处理和未处理的功率谱之间进行加权平均有助于进一步提高识别精度。针对多种语音识别系统,噪声类型和训练条件获得了结果,说明了该方法的广泛用途。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号