首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems
【24h】

Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems

机译:大词汇表语音识别系统中扬声器适应的扬声器聚类和转换

获取原文

摘要

A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test speaker, and using only the data from these speakers (rather than the complete training corpus) to re-estimate the system parameters. Further, a linear transformation is computed for every one of the selected training speakers to better map the training speaker's data to the test speaker's acoustic space. Finally, the system parameters (Gaussian means) are re-estimated specifically for the test speaker using the transformed data from the selected training speakers. Experiments showed that this scheme is capable of reducing the error rate by 10-15% with the use of as little as 3 sentences of adaptation data.
机译:描述了一种扬声器适应策略,其基于查找讲话者的子集,来自训练集,他们在声学靠近测试扬声器,并仅使用来自这些扬声器(而不是完整的培训语料库)来重新估计的数据系统参数。此外,为每个选定的训练扬声器计算线性变换,以更好地将训练扬声器的数据映射到测试扬声器的声学空间。最后,使用来自所选训练扬声器的变换数据专门针对测试扬声器重新估计系统参数(高斯手段)。实验表明,该方案能够将误差率降低10-15%,使用只需的适应数据的3句。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号