首页> 外文会议>2012 11th International Conference on Information Science, Signal Processing and their Applications. >Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model
【24h】

Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model

机译:使用连续隐藏马尔可夫模型的基于HMM的语音合成中的显式持续时间建模

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a novel approach to explicit duration modelling for HMM-based speech synthesis. The proposed approach is a two-step process. The first step in this process is state level phone alignment and conversion of phone durations into the number of frames. In the second step, a hidden Markov model (HMM) is trained whereby the observation is the number of frames in each state and the hidden state the phone. Finally, the duration of each state (the number of frames) is generated from the trained HMM. Hidden semi-Markov model (HSMM) is the baseline for explicit duration modelling in HMM-based speech synthesis. Both objective and perceptual evaluation on a held-out test set showed comparable results with a baseline HSMM-based speech synthesis. This duration modelling approach is computationally simpler than HSMM and produces comparable results in terms of the quality of synthetic speech.
机译:本文提出了一种新的方法,用于基于HMM的语音合成的显式持续时间建模。所提出的方法是一个两步过程。此过程的第一步是状态级别的电话对齐,并将电话持续时间转换为帧数。在第二步中,训练了隐马尔可夫模型(HMM),其中观察值是每种状态下的帧数以及电话的隐形状态。最后,每个状态的持续时间(帧数)是从受过训练的HMM中生成的。隐藏的半马尔可夫模型(HSMM)是基于HMM的语音合成中显式持续时间建模的基准。坚持测试集上的客观评估和感性评估均显示出与基于HSMM基线语音合成的结果相当的结果。这种持续时间建模方法在计算上比HSMM更简单,并且在合成语音的质量方面可产生可比的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号