【24h】

Towards improving ASR robustness for PSN GSM telephone applications

机译:致力于提高PSN和GSM电话应用的ASR鲁棒性

获取原文

摘要

In real-life applications, the speech recognition system errors are mainly due to inadequate detection of speech segments, unreliable rejection of out-of-vocabulary (OOV) words, and noise and transmission channel effects. In this paper, we present the results of several experiments carried out on field vs. laboratory databases and on databases collected over PSN and GSM networks. The main sources of errors are analyzed. Preprocessing techniques as well as HMM adaptation techniques are used to increase the robustness to mismatches between training and testing conditions. We show that a blind equalization scheme improves significantly the recognition accuracy on both field and GSM data. Bayesian adaptation of hidden Markov models (HMM) parameters produces robust models to field conditions. The obtained results prove that HMM adaptation and preprocessing techniques can be advantageously combined, in order to improve ASR robustness.
机译:在实际应用中,语音识别系统错误主要是由于语音段检测不足,拒绝语音(OOV)单词不可靠以及噪声和传输通道效应引起的。在本文中,我们介绍了在现场数据库与实验室数据库以及通过PSN和GSM网络收集的数据库上进行的几次实验的结果。分析了错误的主要来源。预处理技术以及HMM自适应技术可用于提高训练条件和测试条件之间不匹配的鲁棒性。我们表明,盲目均衡方案显着提高了现场和GSM数据的识别精度。隐马尔可夫模型(HMM)参数的贝叶斯自适应产生了针对现场条件的鲁棒模型。获得的结果证明,HMM自适应和预处理技术可以有利地组合在一起,以提高ASR的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号