首页> 外文会议> >Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition
【24h】

Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition

机译:将Jacobian自适应与倒谱均值归一化和并行模型组合进行比较,以实现噪声鲁棒的语音识别

获取原文

摘要

In this paper, two techniques are researched for Jacobian adaptation (JA) in the presence of additive noise. Since the original concept of JA was presented only for static cepstral coefficients, the performance of JA is researched when it is extended to cover also the delta cepstrum. However, this extension or the original concept can not provide accurate recognition performance when the mismatch between the training and recognition environments is out of the linear range of JA. Hence, this problem can be alleviated to some extent by dividing JA into two steps. At first, the adaptation is done e.g. from clean to the target environment having "high" SNR level. After that, the new JA matrices are calculated and they are used in the second step to adapt the system to the lower target SNR level. Both of the above adaptation methods have been compared to cepstral mean normalization (CMN) and parallel model combination (PMC) in an isolated word recognition task having a vocabulary of 200 English words. The best performance was achieved with PMC but JA showed comparable performance to CMN and outperformed it when JA was done in two steps from SNR of 25 dB to 5 dB. The system was tested with the SpeechDat(II) database by adding noise recorded inside a car to the test set utterances at various SNR levels.
机译:在本文中,研究了在存在加性噪声的情况下针对雅可比适应(JA)的两种技术。由于JA的原始概念仅针对静态倒频谱系数提出,因此在扩展JA到同时涵盖三角倒频谱时,将对JA的性能进行研究。但是,当训练和识别环境之间的不匹配超出JA的线性范围时,此扩展或原始概念无法提供准确的识别性能。因此,通过将JA分为两个步骤,可以在某种程度上缓解此问题。首先,进行适应例如从干净到具有“高” SNR水平的目标环境。之后,将计算新的JA矩阵,并将其用于第二步,以使系统适应较低的目标SNR级别。在具有200个英语单词的词汇量的孤立单词识别任务中,已将上述两种自适应方法与倒谱平均归一化(CMN)和并行模型组合(PMC)进行了比较。 PMC可以达到最佳性能,但是JA表现出与CMN相当的性能,并且在从25 dB的SNR到5 dB的两个步骤中完成JA时,JA的性能均胜过其。通过在不同的SNR级别上将记录在车内的噪音添加到测试装置的发声中,使用SpeechDat(II)数据库对该系统进行了测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号