首页> 外文会议>2011 IEEE International Conference on Acoustics, Speech and Signal Processing >Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation
【24h】

Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation

机译:用于声学模型自适应的基于MAP的映射函数估计的判别能力不断增强

获取原文

摘要

In this study, we propose increasing discriminative power on the maximum a posteriori (MAP)-based mapping function estimation for acoustic model adaptation. Based on the effective and stable learning advantages of MAP-based estimation, we incorporate a discriminative term and derive a new objective function. By applying the new function for online mapping function estimation, we developed discriminative maximum a posteriori (DMAP) linear regression (DMAPLR) and DMAP-based ensemble speaker and speaking environment modeling (DMAP-based ESSEM). We evaluate the DMAPLR and DMAP-based ESSEM on the Aurora-2 task in a supervised adaptation mode. The experimental results show that both DMAPLR and DMAP-based ESSEM consistently provide improvements over their ML-based and MAP-based counterparts irrespective of using one, two, or three adaptation utterances. From the improvements, we confirm the strong effect of increasing discriminative capability on the MAP-based mapping function estimation. Moreover, we verify that including multiple knowledge sources in the objective function can efficiently enhance model adaptation performance. When compared with the baseline result, DMAP-ESSEM achieves a 15.96% (9.21% to 7.74%) average word error rate (WER) reduction using only one adaptation utterance.
机译:在这项研究中,我们建议针对基于声学模型适应性的基于后验(MAP)的最大映射函数估计,提高判别能力。基于基于MAP的评估的有效和稳定的学习优势,我们引入了判别项,并得出了新的目标函数。通过将新功能应用到在线映射功能估计中,我们开发了判别最大后验(DMAP)线性回归(DMAPLR)和基于DMAP的整体演讲者和说话环境建模(基于DMAP的ESSEM)。我们在有监督的适应模式下对Aurora-2任务评估基于DMAPLR和基于DMAP的ESSEM。实验结果表明,无论使用一种,两种或三种适应话语,DMAPLR和基于DMAP的ESSEM都始终比其基于ML和基于MAP的同行提供改进。从这些改进中,我们确认了增加判别能力对基于MAP的映射函数估计的强大影响。此外,我们验证了在目标函数中包含多个知识源可以有效地提高模型自适应性能。与基线结果相比,DMAP-ESSEM仅使用一种适应性话语即可减少15.96%(9.21%至7.74%)的平均单词错误率(WER)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号