首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Speaker clustering and transformation for speaker adaptation in speech recognition systems
【24h】

Speaker clustering and transformation for speaker adaptation in speech recognition systems

机译:语音识别系统中的说话人适应和说话人聚类和转换

获取原文
获取原文并翻译 | 示例
           

摘要

A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test speaker, and using only the data from these speakers (rather than the complete training corpus) to reestimate the system parameters. Further, a linear transformation is computed for every one of the selected training speakers to better map the training speaker's data to the test speaker's acoustic space. Finally, the system parameters (Gaussian means) are reestimated specifically for the test speaker using the transformed data from the selected training speakers. Experiments showed that this scheme is capable of providing an 18% relative improvement in the error rate on a large-vocabulary task with the use of as little as three sentences of adaptation data.
机译:描述了说话人适应策略,该策略基于从训练集中找到在听觉上靠近测试说话人的说话人子集,并仅使用来自这些说话人的数据(而不是完整的训练语料库)来重新估计系统参数。此外,为每个选定的训练说话者计算线性变换,以更好地将训练说话者的数据映射到测试说话者的声学空间。最后,使用来自所选训练说话者的转换数据,专门针对测试说话者重新估计系统参数(高斯均值)。实验表明,该方案通过使用少至三个句子的自适应数据,就可以将大型词汇任务的错误率相对提高18%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号