首页> 外文会议>Asilomar Conference on Signals, Systems and Computers >Time-frequency correlation based missing-feature reconstruction for robust speech recognition in background noise conditions
【24h】

Time-frequency correlation based missing-feature reconstruction for robust speech recognition in background noise conditions

机译:基于时频相关的缺失特征重建在背景噪声条件下的鲁棒语音识别

获取原文

摘要

This study proposes a novel missing-feature reconstruction method to improve speech recognition in background noise environments. In order to improve the existing missing-feature reconstruction method which utilizes only frequency correlation, a temporal spectral feature analysis is employed to leverage temporal correlation across neighboring frames. The final estimates for missing-feature reconstruction are obtained by a selective combination of the frequency correlation based method and the proposed temporal correlation based method. Performance of the proposed method is evaluated using the Aurora 2.0 framework with car noise and speech babble conditions. Experimental results demonstrate that the proposed method is more effective at increasing speech recognition performance in adverse conditions. By employing the proposed temporal-frequency based reconstruction method with SNR-based mask estimation, +21.31% and +20.73% average relative improvements in WER are obtained for car and speech babble conditions, compared to the original frequency correlation based method.
机译:这项研究提出了一种新颖的缺失特征重建方法,以改善背景噪声环境下的语音识别。为了改进现有的仅利用频率相关性的缺失特征重建方法,采用时间频谱特征分析来利用跨相邻帧的时间相关性。通过基于频率相关的方法和建议的基于时间相关的方法的选择性组合,可以获得缺失特征重构的最终估计值。使用Aurora 2.0框架在汽车噪音和语音ba语条件下评估了所提出方法的性能。实验结果表明,该方法在不利条件下能有效提高语音识别性能。与原始的基于频率相关性的方法相比,通过采用所提出的基于时频的重建方法和基于SNR的掩码估计,在汽车和语音ba语条件下,WER的平均相对改善率分别为+ 21.31%和+ 20.73%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号