首页> 外文会议>2011 IEEE Workshop on Automatic Speech Recognition amp; Understanding >Speaker adaptation based on speaker-dependent eigenphone estimation
【24h】

Speaker adaptation based on speaker-dependent eigenphone estimation

机译:基于说话人相关本征电话估计的说话人自适应

获取原文
获取原文并翻译 | 示例

摘要

Based on speaker dependent eigenphone estimation, a novel speaker adaptation technique is proposed in this paper. Different from conventional speaker adaptation approaches, the proposed method explicitly models the phone variations for each speaker through subspace modeling in the phone space. The phone coordinate, which is shared by all speakers, contains correlation information between different phones. During speaker adaptation, two schemes for estimation of the new speaker specific phone variation bases (namely eigenphones) are derived under maximum likelihood (ML) criterion and maximum a posteriori (MAP) criterion respectively. Supervised speaker adaptation experiments on a Mandarin Chinese continuous speech recognition task show that the new method outperforms both eigenvoice and maximum likelihood linear regression (MLLR) methods when sufficient adaptation data is available.
机译:基于说话人本征电话估计,提出了一种新颖的说话人自适应技术。与传统的说话人适应方法不同,所提出的方法通过电话空间中的子空间建模为每个说话人显式地建模电话变化。所有扬声器共享的电话坐标包含不同电话之间的关联信息。在说话者自适应期间,分别在最大似然(ML)准则和最大后验(MAP)准则下推导出两种用于估计新的特定于说话者的电话变化基础(即本征电话)的方案。在普通话连续语音识别任务上的有监督的说话人适应性实验表明,在有足够的适应性数据时,新方法的性能优于特征语音和最大似然线性回归(MLLR)方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号