首页> 外文会议>International Conference on Text, Speech and Dialogue(TSD 2005); 20050912-15; Karlovy Vary(CZ) >Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech
【24h】

Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech

机译:捷克语大词汇量连续语音识别中有监督和无监督说话人适应

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper deals with the problem of efficient speaker adaptation in large vocabulary continuous speech recognition (LVCSR) systems. The main goal is to adapt acoustic models of speech and to increase the recognition accuracy of these systems in tasks, where only one user is expected (e.g. voice dictation) or where the speaking person can be identified automatically (e.g. broadcast news transcription). For this purpose, we propose several modifications of the well known MLLR (Maximum Likelihood Linear Regression) method and we combine them with the MAP (Maximum A Posteriori) method. The results from a series of experiments show that the error rate of our 300K-word Czech recogniser can be reduced by about 9.9 % when only 30 seconds of supervised data are used for adaptation or by about 9.6 % when unsupervised adaptation on the same data is performed.
机译:本文探讨了大型词汇连续语音识别(LVCSR)系统中有效的说话人适应问题。主要目标是适应语音的声学模型并提高这些系统在任务中的预期任务中的识别准确性,这些任务只需要一个用户(例如语音听写),或者可以自动识别讲话者(例如广播新闻转录)。为此,我们提出了对众所周知的MLLR(最大似然线性回归)方法的几种修改,并将它们与MAP(最大后验线性)方法结合在一起。一系列实验的结果表明,当仅使用30秒的有监督数据进行自适应时,我们的30万字捷克识别器的错误率可降低9.9%;如果对相同数据进行无监督适应,则可将错误率降低约9.6%。执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号