首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >A full training framework of cross-stream dependence modelling for HMM-based singing voice synthesis
【24h】

A full training framework of cross-stream dependence modelling for HMM-based singing voice synthesis

机译:基于HMM的歌声合成的跨流依赖性建模的完整训练框架

获取原文

摘要

A cross-stream dependence modelling (CSDM) method has been proposed to model the dependence of spectral distributions on F0 observations for hidden Markov model (HMM) based speech synthesis. However, this method incorporates CSDM only for the embedded training of HMM estimation while ignoring CSDM in the clustering of context-dependent HMMs. This paper applies CSDM to HMM-based singing voice synthesis and presents a decision-tree-based model clustering method with explicit CSDM. This method, in conjunction with the previous CSDM method, forms a full CSDM training framework. Experimental results demonstrate that this full CSDM training framework achieves better performance than the previous CSDM method and the baseline without CSDM in a singing voice synthesis task.
机译:为了基于隐马尔可夫模型(HMM)进行语音合成,提出了一种跨流依赖性建模(CSDM)方法来建模频谱分布对F0观测值的依赖性。但是,此方法仅将CSDM用于HMM估计的嵌入式训练,而在上下文相关HMM的聚类中忽略CSDM。本文将CSDM应用于基于HMM的歌声合成中,提出了一种采用显式CSDM的基于决策树的模型聚类方法。该方法与先前的CSDM方法结合在一起,形成了完整的CSDM培训框架。实验结果表明,完整的CSDM训练框架在歌唱语音合成任务中比以前的CSDM方法和没有CSDM的基线具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号