首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >HYBRID ACOUSTIC MODELS FOR DISTANT AND MULTICHANNEL LARGE VOCABULARY SPEECH RECOGNITION
【24h】

HYBRID ACOUSTIC MODELS FOR DISTANT AND MULTICHANNEL LARGE VOCABULARY SPEECH RECOGNITION

机译:遥远和多通道大词汇语音识别的混合声学模型

获取原文

摘要

We investigate the application of deep neural network (DNN)-hidden Markov model (HMM) hybrid acoustic models for far-field speech recognition of meetings recorded using microphone arrays. We show that the hybrid models achieve significantly better accuracy than conventional systems based on Gaussian mixture models (GMMs). We observe up to 8% absolute word error rate (WER) reduction from a discrimina-tively trained GMM baseline when using a single distant microphone, and between 4-6% absolute WER reduction when using beamforming on various combinations of array channels. By training the networks on audio from multiple channels, we find the networks can recover significant part of accuracy difference between the single distant microphone and beamformed configurations. Finally, we show that the accuracy of a network recognising speech from a single distant microphone can approach that of a multi-microphone setup by training with data from other microphones.
机译:我们研究了使用麦克风阵列记录的麦克风识别的远场语音识别的深神经网络(DNN)混合声学模型的应用。我们表明,混合模型比基于高斯混合模型(GMMS)的传统系统实现了明显的准确性。当使用单个远距离麦克风时,我们观察到最多8%的绝对字错误率(WER)减少,以及在阵列通道的各种组合上使用波束形成时,在4-6%的绝对WER减小之间。通过从多个通道培训音频上的网络,我们发现网络可以在单个远程麦克风和波束成形配置之间恢复精度差异的重要部分。最后,我们表明,通过使用来自其他麦克风的数据训练,网络识别来自单个远处麦克风的语音的准确性可以接近多麦克风设置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号