...
首页> 外文期刊>IEICE Transactions on Information and Systems >Hmm-based Style Control For Expressive Speech Synthesis With Arbitrary Speaker's Voice Using Model Adaptation
【24h】

Hmm-based Style Control For Expressive Speech Synthesis With Arbitrary Speaker's Voice Using Model Adaptation

机译:基于模型自适应的基于Hmm的风格控制,用于任意讲话者语音的表达性语音合成

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents methods for controlling the intensity of emotional expressions and speaking styles of an arbitrary speaker's synthetic speech by using a small amount of his/her speech data in HMM-based speech synthesis. Model adaptation approaches are introduced into the style control technique based on the multiple-regression hidden semi-Markov model (MRHSMM). Two different approaches are proposed for training a target speaker's MRHSMMs. The first one is MRHSMM-based model adaptation in which the pretrained MRHSMM is adapted to the target speaker's model. For this purpose, we formulate the MLLR adaptation algorithm for the MRHSMM. The second method utilizes simultaneous adaptation of speaker and style from an average voice model to obtain the target speaker's style-dependent HSMMs which are used for the initialization of the MRHSMM. From the result of subjective evaluation using adaptation data of 50 sentences of each style, we show that the proposed methods outperform the conventional speaker-dependent model training when using the same size of speech data of the target speaker.
机译:本文提出了在基于HMM的语音合成中使用少量语音数据来控制任意说话者的合成语音的情绪表达和说话风格的强度的方法。在多回归隐藏半马尔可夫模型(MRHSMM)的基础上,将模型自适应方法引入样式控制技术中。提出了两种不同的方法来训练目标说话人的MRHSMM。第一个是基于MRHSMM的模型适配,其中,将预训练的MRHSMM适配到目标说话者的模型。为此,我们为MRHSMM制定了MLLR自适应算法。第二种方法利用说话人和说话人风格的平均语音模型同时适应,以获得目标说话人的风格相关HSMM,用于MRHSMM的初始化。从使用每种风格的50个句子的适应数据进行主观评估的结果中,我们表明,当使用目标说话者相同大小的语音数据时,所提出的方法优于传统的依赖说话者的模型训练。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号