首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis
【24h】

Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

机译:强大的基于说话人自适应HMM的文本到语音合成

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a speaker-adaptive HMM-based speech synthesis system. The new system, called “HTS-2007,” employs speaker adaptation (CSMAPLR+MAP), feature-space adaptive training, mixed-gender modeling, and full-covariance modeling using CSMAPLR transforms, in addition to several other techniques that have proved effective in our previous systems. Subjective evaluation results show that the new system generates significantly better quality synthetic speech than speaker-dependent approaches with realistic amounts of speech data, and that it bears comparison with speaker-dependent approaches even when large amounts of speech data are available. In addition, a comparison study with several speech synthesis techniques shows the new system is very robust: It is able to build voices from less-than-ideal speech data and synthesize good-quality speech even for out-of-domain sentences.
机译:本文介绍了一种基于说话者自适应的基于HMM的语音合成系统。新系统称为“ HTS-2007”,除了已证明有效的其他几种技术之外,还采用了说话人自适应(CSMAPLR + MAP),特征空间自适应训练,混合性别建模和使用CSMAPLR变换的全协方差建模。在我们以前的系统中。主观评估结果表明,与具有真实语音数据量的基于说话者的方法相比,新系统产生的合成语音质量要好得多,并且即使有大量语音数据,它也可以与基于说话者的方法进行比较。此外,与几种语音合成技术的比较研究表明,该新系统非常健壮:它能够从不理想的语音数据中构建语音,甚至可以针对域外句子合成高质量的语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号