首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Evaluation of linear regression for speaker adaptation in HMM-based articulatory movements estimation
【24h】

Evaluation of linear regression for speaker adaptation in HMM-based articulatory movements estimation

机译:基于HMM的关节运动估计中说话人适应性的线性回归评估

获取原文

摘要

Acoustic-to-articulatory inversion problem is usually studied in speaker-specific manner because both articulatory data and acoustic features contain speaker-specific components. This paper presents our work on speaker-adaptation training for this problem. We implement speaker adaptation in HMM-based acoustic-to-articulatory inversion mapping, and evaluate different combinatorial structures of the articulatory data and acoustic features. The HMM-based inversion mapping models are built with single-stream and multistream, independent clustering and shared clustering structures. The speaker adaptation is implemented in stream-independent structure and shared adaptation structure. The constrained maximum likelihood linear regression method is used for the speaker-adaptive transformation. The experimental results show that the sharing of the speaker-adaptive transformation of the articulatory feature stream and acoustic feature stream can improve the estimation accuracy in inversion mapping. The multi-stream system with shared clustering and shared adaptive transformation has the best result among all the tested structures.
机译:语音到发音的反转问题通常以特定于说话者的方式进行研究,因为发音数据和声学特征都包含特定于说话者的成分。本文介绍了我们针对此问题的说话人适应训练的工作。我们在基于HMM的声学到发音反演映射中实现说话人自适应,并评估发音数据和声学特征的不同组合结构。基于HMM的反演映射模型是使用单流和多流,独立的聚类和共享的聚类结构构建的。说话者自适应以与流无关的结构和共享自适应结构来实现。约束最大似然线性回归方法用于说话人自适应转换。实验结果表明,发音特征流和声学特征流的说话人自适应变换的共享可以提高反演映射的估计精度。在所有测试结构中,具有共享聚类和共享自适应变换的多流系统具有最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号