【24h】

STRUCTURAL KLD FOR CROSS-VARIETY SPEAKER ADAPTATION IN HMM-BASED SPEECH SYNTHESIS

机译:基于HMM的语音合成中跨物种扬声器自适应的结构KLD

获取原文
获取原文并翻译 | 示例

摘要

While the synthesis of natural sounding, neutral stylernspeech can be achieved using today’s technology, fast adaptationrnof speech synthesis to different contexts and situationsrnstill poses a challenge. In the context of varietyrnmodeling (dialects, sociolects) we have to cope with thernproblem that no standardized orthographic form is availablernand that existing speech resources for these varietiesrnare rare. We present recent approaches in the fieldrnof cross-lingual speaker transformation for HMM-basedrnspeech synthesis and propose a method for transforming anrnarbitrary speaker’s voice from one variety to another one.rnWe apply Kullback-Leibler divergence for data mapping ofrnHMM-states, transfer probability density functions to therndecision tree of the other variety and perform speaker adaptation.rnA method to integrate structural information in thernmapping is also presented and analyzed. Subjective listeningrntests show that the proposed method produces speechrnof significantly higher quality than standard speaker adaptationrntechniques.
机译:虽然可以使用当今的技术来实现自然发声,中性风格的语音合成,但快速的语音合成以适应不同的环境和情况仍然是一个挑战。在变体建模(方言,社会学)的背景下,我们必须应对以下问题:没有可用的标准化拼字形式,并且这些变体的现有语音资源很少。我们介绍了基于HMM的语音合成在fieldrnof跨语言说话人转换中的最新方法,并提出了一种将任意说话人的声音从一个变体转换为另一个变体的方法。还提出并分析了一种将结构信息整合到热成像中的方法。主观听觉测试表明,所提出的方法所产生的语音质量比标准的说话人适应技术要高得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号