首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Separating Speaker and Environmental Variability Using Factored Transforms
【24h】

Separating Speaker and Environmental Variability Using Factored Transforms

机译:使用分解因子分离说话者和环境变异性

获取原文

摘要

Two primary sources of variability that degrade accuracy in speech recognition systems are the speaker and the environment. While many algorithms for speaker or environment adaptation have been proposed to improve performance, far less attention has been paid to approaches which address for both factors. In this paper, we present a method for compensating for speaker and environmental mismatch using a cascade of CMLLR transforms. The proposed approach enables speaker transforms estimated in one environment to be effectively applied to speech from the same user in a different environment. This approach can be further improved using a new training method called speaker and environment adaptive training method. When applying speaker transforms to new environments, the proposed approach results in a 13% relative improvement over conventional CMLLR.
机译:导致语音识别系统准确性下降的两个主要可变性来源是说话者和环境。虽然已经提出了许多用于说话人或环境适应的算法来提高性能,但是对于解决这两个因素的方法的关注却很少。在本文中,我们提出了一种使用级联的CMLLR变换补偿说话人和环境不匹配的方法。所提出的方法使得在一个环境中估计的说话者变换能够有效地应用于来自不同环境中的同一用户的语音。可以使用称为说话者的新训练方法和环境自适应训练方法来进一步改进此方法。当将说话者变换应用于新环境时,所提出的方法相对于传统CMLLR带来13%的相对改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号