首页> 外文会议>European conference on speech communication and technology >A Posteriori and a Priori Transformations for Speaker Adaptation in Large Vocabulary Speech Recognition Systems
【24h】

A Posteriori and a Priori Transformations for Speaker Adaptation in Large Vocabulary Speech Recognition Systems

机译:大型词汇语音识别系统中扬声器适应的后验和先验变换

获取原文

摘要

The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corresponding speaker-independent recognizers. The aim of speaker adaptation techniques is to enhance the speaker-independent acoustic models to bring their recognition accuracy as close as possible to the one obtained with speaker-dependent models. In this paper, we propose a method using test and training data for acoustic model adaptation. This method operates in two steps. The first one performs an a priori adaptation using the transcribed training data of the closest training speakers to the test speaker. This adaptation is done with MAP procedure allowing reduced variances in the acoustic models. The second one performs an a posteriori adaptation using the MLLR procedure on the test data, allowing mapping of Gaussians means to match the test speaker's acoustic space. This adaptation strategy was evaluated in a large vocabulary speech recognition task. Our method leads to a relative gain of 15% with respect to the baseline system and 10% with respect to the conventional MLLR adaptation.
机译:与相应的扬声器 - 独立识别器相比,基于扬声器的基于HMM的识别器提供了较低的错误率。扬声器适配技术的目的是增强扬声器 - 独立的声学模型,以使其识别精度尽可能靠近用扬声器依赖模型获得。在本文中,我们提出了一种使用测试和培训数据进行声学模型适应的方法。该方法以两个步骤操作。第一个使用最近训练扬声器的转录训练数据来执行先验的适应性。使用MAP过程完成这种适配,允许声学模型中的差异降低。第二个使用测试数据上的MLLR过程执行后验,允许高斯映射意味着匹配测试扬声器的声学空间。这种适应策略在大型词汇表识别任务中进行了评估。我们的方法相对于基线系统的相对增益为15%,相对于传统的MLL适应而导致10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号