首页> 外国专利> Speaker and environment adaptation based on eigenvoices including maximum likelihood method

Speaker and environment adaptation based on eigenvoices including maximum likelihood method

机译:基于特征语音的说话人和环境适应,包括最大似然法

摘要

A set of speaker dependent models is trained upon a comparatively large number of training speakers, one model per speaker, and model parameters are extracted in a predefined order to construct a set of supervectors, one per speaker. Principle component analysis is then performed on the set of supervectors to generate a set of eigenvectors that define an eigenvoice space. If desired, the number of vectors may be reduced to achieve data compression. Thereafter, a new speaker provides adaptation data from which a supervector is constructed by constraining this supervector to be in the eigenvoice space based on a maximum likelihood estimation. The resulting coefficients in the eigenspace of this new speaker may then be used to construct a new set of model parameters from which an adapted model is constructed for that speaker. Environmental adaptation may be performed by including environmental variations in the training data.
机译:在相对大量的训练说话者上训练一组说话者相关模型,每个说话者一个模型,并且以预定顺序提取模型参数以构造一组超向量,每个说话者一个。然后,对这组超向量执行主成分分析,以生成定义特征语音空间的一组特征向量。如果需要,可以减少向量的数量以实现数据压缩。此后,新的说话者提供适应数据,通过基于最大似然估计将超向量约束在本征语音空间中,从该自适应数据构造超向量。然后,可以使用该新说话者的本征空间中的所得系数来构建新的模型参数集,从中为该说话者构建适应模型。可以通过在训练数据中包括环境变化来执行环境适应。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号