首页> 外文会议>European conference on speech communication and technology >Rapid Vocal Tract Length Normalization using Maximum Likelihood Estimation
【24h】

Rapid Vocal Tract Length Normalization using Maximum Likelihood Estimation

机译:利用最大似然估计快速声乐道长度归一化

获取原文

摘要

Recently, vocal tract length normalization (VTLN) techniques have been developed for speaker normalization in speech recognition. This paper proposes a new VTLN method, in which the vocal tract length is normalized in the cepstrum space by means of linear mapping whose parameter is derived using maximum-likelihood estimation. The computational costs of this method are much lower than that of such conventional methods as ML-VTLN, in which the parameter for mapping is selected from among several parameters. Further, the new method offers greater precision in determining parameters for individual speakers. Experimental use of the method resulted in an error reduction rate of 7.1 %. A combination of the proposed method with cepstrum mean normalization (CMN) method was also examined and found to reduce the error rate even more, by 14.6%.
机译:最近,已经开发了语音识别中的扬声器归一化的声乐道长度标准化(VTLN)技术。本文提出了一种新的VTLN方法,其中通过线性映射通过线性映射在克斯特劳空间中归一化了声乐道长度,其参数使用最大似然估计来导出。该方法的计算成本远低于这种传统方法作为M1-VTLN的计算成本,其中从几个参数中选择用于映射的参数。此外,新方法在确定各个扬声器的参数时提供更精确的精度。该方法的实验使用导致误差降低率为7.1%。还研究了综合性归一化(CMN)方法的提出方法的组合,发现更低的误差率降低了14.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号