首页> 外文会议> >Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems
【24h】

Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems

机译:大词汇量语音识别系统中说话人的聚类和转换,以适应说话人

获取原文

摘要

A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test speaker, and using only the data from these speakers (rather than the complete training corpus) to re-estimate the system parameters. Further, a linear transformation is computed for every one of the selected training speakers to better map the training speaker's data to the test speaker's acoustic space. Finally, the system parameters (Gaussian means) are re-estimated specifically for the test speaker using the transformed data from the selected training speakers. Experiments showed that this scheme is capable of reducing the error rate by 10-15% with the use of as little as 3 sentences of adaptation data.
机译:描述了说话人适应策略,该策略基于从训练集中找到听觉上离测试说话人较近的说话人子集,并仅使用来自这些说话人的数据(而不是完整的训练语料库)来重新估计系统参数。此外,为每个选定的训练说话者计算线性变换,以更好地将训练说话者的数据映射到测试说话者的声学空间。最后,使用来自所选训练说话者的变换后的数据,专门针对测试说话者重新估计系统参数(高斯均值)。实验表明,该方案只需使用3个句子的自适应数据就能将错误率降低10-15%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号