首页> 外文会议>International Conference on Spoken Language >N-best-based instantaneous speaker adaptation method for speech recognition
【24h】

N-best-based instantaneous speaker adaptation method for speech recognition

机译:基于N-BIST的语音识别瞬时扬声器适应方法

获取原文

摘要

An instantaneous speaker adaptation method is proposed that uses N-best decoding for continuous mixture-density hidden Markov model-based speech recognition systems. An N-best paradigm of multiple-pass search strategies is used that makes this method effective even for speakers whose decodings using speaker-independent models are error-prone. To cope with an insufficient amount of data, our method uses constrained maximum a posteriori estimation, in which the parameter vector space is clustered, and a mixture-mean bias is estimated for each cluster. Moreover, to maintain continuity between clusters, a bias for each mixture-mean is calculated as the weighted sum of the estimated biases. Performance evaluation using connected-digit (four-digit strings) recognition experiments performed over actual telephone lines showed more than a 20% reduction in the error rates, even for speakers whose decodings using speaker-independent models were error-prone.
机译:提出了一种瞬时扬声器适应方法,其利用基于连续混合密度隐马尔可夫模型的语音识别系统的n最佳解码。使用了多遍搜索策略的N-Best范式,使得这种方法即使对于使用扬声器的型号的解码的扬声器也有效。为了应对数据量不足,我们的方法使用约束的最大估计,其中参数矢量空间被聚集,并且估计每个簇的混合均衡。此外,为了保持簇之间的连续性,计算每个混合均值的偏差是计算为估计偏差的加权之和。使用连接数字(四位字符串)的性能评估在实际电话线上进行的识别实验显示出误差率的减少超过20%,即使对于使用扬声器无关的模型的解码的扬声器也是容易出错的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号