【24h】

Improvements of a dual-input DBN for noise robust ASR

机译:双输入DBN的改进以增强抗噪ASR

获取原文

摘要

In previous work we have shown that an ASR system consisting of a dual-input Dynamic Bayesian Network (DBN) which simultaneously observes MFCC acoustic features and an exemplar-based Sparse Classification (SC) phoneme predictor stream can achieve better word recognition accuracies in noise than a system that observes only one input stream. This paper explores three modifications of SC input to further improve the noise robustness of the dual-input DBN system: 1) using state likelihoods instead of phonemes, 2) integrating more contextual information and 3) using a complete set of likelihood distribution. Experiments on aurora-2 reveal that the combination of the first two approaches significantly improves the recognition results, achieving up to 29% (absolute) accuracy gain at SNR -5 dB. In the dual-input system using the full likelihood vector does not outperform using the best state prediction.
机译:在先前的工作中,我们表明,由双输入动态贝叶斯网络(DBN)同时观察MFCC声学特征和基于示例的稀疏分类(SC)音素预测器流组成的ASR系统,在噪声方面比在单词识别方面的准确性要高。一种仅观察一个输入流的系统。本文探讨了SC输入的三种修改,以进一步提高双输入DBN系统的噪声鲁棒性:1)使用状态似然代替音素,2)集成更多上下文信息,以及3)使用完整的似然分布集。在aurora-2上进行的实验表明,前两种方法的组合可显着改善识别结果,在SNR -5 dB时可获得高达29%(绝对)的准确度增益。在使用全似然向量的双输入系统中,使用最佳状态预测不会胜过任何情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号