首页> 外文会议> >Bayesian adaptation of speech recognizers to field speech data
【24h】

Bayesian adaptation of speech recognizers to field speech data

机译:贝叶斯语音识别器对现场语音数据的适应

获取原文

摘要

The article studies a Bayesian (or Maximum A Posteriori MAP) approach to the adaptation of continuous density hidden Markov models (CDHMMs) to a specific condition of a speech recognition application. In order to improve the model robustness, CDHMMs formerly trained from laboratory data are then adapted using context dependent field utterances. Two specific problems have to be faced when using the MAP approach: the estimation of the a priori distribution parameters and the lack of field adaptation data for some distributions of the CDHMM. To estimate the a priori distribution parameters, we need to identify different realizations of the model parameters. Three different solutions are proposed and evaluated. To overcome the lack of adaptation data, field acoustical training frames may be shared among similar distributions. This is performed using an acoustical tree, obtained by progressively clustering the model distributions. Recognition results show that MAP adapted models significantly outperform those trained by maximum likelihood (ML), specifically when the field data set is small.
机译:本文研究了一种贝叶斯(或最大后验MAP)方法,以使连续密度隐藏马尔可夫模型(CDHMM)适应语音识别应用的特定条件。为了提高模型的鲁棒性,然后使用上下文相关字段话语对以前从实验室数据中训练的CDHMM进行调整。使用MAP方法时,必须面对两个具体问题:先验分布参数的估计以及CDHMM某些分布的场适应数据的缺乏。为了估计先验分布参数,我们需要确定模型参数的不同实现。提出并评估了三种不同的解决方案。为了克服缺乏适应性数据的问题,可以在类似的分布中共享现场声学训练帧。这是通过使用声学树来执行的,该声学树是通过逐步聚类模型分布而获得的。识别结果表明,适用于MAP的模型明显优于通过最大似然(ML)训练的模型,特别是在现场数据集较小时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号