【24h】

Noisy Speech Recognition Performance of Discriminative HMMs

机译:区分性HMM的嘈杂语音识别性能

获取原文
获取原文并翻译 | 示例

摘要

Discriminatively trained HMMs are investigated in both clean and noisy environments in this study. First, a recognition error is defined at different levels including string, word, phone and acoustics. A high resolution error measure in terms of minimum divergence (MD) is specifically proposed and investigated along with other error measures. Using two speaker-independent continuous digit databases, Aurora2(English) and CNDigits (Mandarin Chinese), the recognition performance of recognizers, which are trained in terms of different error measures and using different training modes, is evaluated under different noise and SNR conditions. Experimental results show that discriminatively trained models performed better than the maximum likelihood baseline systems. Specifically, for MD trained systems, relative error reductions of 17.62% and 18.52% were obtained applying multi-training on Aurora2 and CNDigits, respectively.
机译:在这项研究中,在干净和嘈杂的环境中对经过歧视性训练的HMM进行了研究。首先,识别错误定义在不同的级别,包括字符串,单词,电话和声学。特别提出了一种基于最小散度(MD)的高分辨率误差度量,并与其他误差度量一起进行了研究。使用两个独立于说话者的连续数字数据库Aurora2(英语)和CNDigits(普通话中文),在不同的噪声和SNR条件下,对根据不同的错误度量和不同的训练模式进行训练的识别器的识别性能进行评估。实验结果表明,经过判别训练的模型的性能优于最大似然基线系统。具体来说,对于经过MD训练的系统,通过在Aurora2和CNDigits上进行多次训练,相对误差减少了17.62%和18.52%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号