首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood
【24h】

Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood

机译:基于声学模型似然性的去混响参数优化的鲁棒语音识别

获取原文
获取原文并翻译 | 示例
       

摘要

Automatic speech recognition (ASR) in reverberant environments is a challenging task. Most dereverberation techniques address this problem through signal processing and enhances the reverberant waveform independent from the speech recognizer. In this paper, we propose a novel scheme to perform dereverberation in relation with the likelihood of the back-end ASR system. Our proposed approach effectively selects the dereverberation parameters, in the form of multiband scale factors, so that they improve the likelihood of the acoustic model. Then, the acoustic model is retrained using the optimal parameters. During the recognition phase, we implement additional optimization of the parameters. By using Gaussian mixture model (GMM), the process for selecting the scale factors become efficient. Moreover, we remove the dependency of the adopted dereverberation technique on the room impulse response (RIR) measurement, by using an artificial RIR generator and selecting based on the acoustic likelihood. Experimental results show significant improvement in recognition performance with the proposed method over the conventional approach.
机译:混响环境中的自动语音识别(ASR)是一项艰巨的任务。大多数去混响技术通过信号处理解决了这个问题,并增强了独立于语音识别器的混响波形。在本文中,我们提出了一种与后端ASR系统的可能性相关的执行去混响的新方案。我们提出的方法以多频带比例因子的形式有效地选择了去混响参数,从而提高了声学模型的可能性。然后,使用最佳参数重新训练声学模型。在识别阶段,我们将对参数进行其他优化。通过使用高斯混合模型(GMM),选择比例因子的过程变得高效。此外,通过使用人工RIR生成器并根据声学似然性进行选择,我们消除了所采用的去混响技术对房间脉冲响应(RIR)测量的依赖性。实验结果表明,与传统方法相比,该方法在识别性能上有显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号