首页> 外文会议>International Conference on Statistical Language and Speech Processing >A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation
【24h】

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

机译:关于组合GMM和DNN框架对扬声器适应的新视角

获取原文

摘要

In this paper we investigate the GMM-derived features for adaptation of context-dependent deep neural network HMM (CD-DNN-HMM) acoustic models with the focus on exploration of fusion of the adapted GMM-derived features and the conventional bottleneck features. We analyze and compare different types of fusion, such as feature level, posterior level, lattice level and others in order to discover the best possible way of fusion. Experimental results on the TED-LIUM corpus show that the proposed adaptation technique can be effectively integrated into DNN setup at different levels and provide additional gain in recognition performance: up to 6% of relative word error rate reduction (WERR) over the strong speaker adapted DNN baseline, and up to 22% of relative WERR in comparison with a speaker independent DNN baseline model, trained on conventional features.
机译:在本文中,我们研究了GMM导出的特征,用于改编上下文的深度神经网络嗯(CD-DNN-HMM)声学模型,重点是探索适用的GMM导出的特征和传统瓶颈特征的融合。我们分析和比较不同类型的融合,如特征级,后水平,晶格水平等,以发现最佳的融合方式。 TED-Lium语料库上的实验结果表明,所提出的适应技术可以在不同级别中有效地集成到DNN设置中,并提供识别性能的额外增益:在强大的扬声器上,高达6%的相对字错误率减少(WERR)适应与扬声器独立的DNN基线模型相比,DNN基线,相对WER的高达22%,接受常规特征培训。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号