首页> 外文期刊>Electronics Letters >Optimal state duration assignment in hidden Markov model-based text-to-speech synthesis system
【24h】

Optimal state duration assignment in hidden Markov model-based text-to-speech synthesis system

机译:基于隐马尔可夫模型的文本语音合成系统中的最佳状态持续时间分配

获取原文
获取原文并翻译 | 示例
           

摘要

In state-of-the-art text-to-speech (TTS) systems the state durations for each phoneme are generated so as to maximise the state sequence probability given the constraint that the sum of all state durations should be equal to the phoneme duration. Such maximisation sometimes results in negative state durations when the specified phoneme duration is less than the sum of the means of all the states of the phoneme. Such discrepancy implicitly results in the violation of the equality constraint. This has implications for speech research problems, in which each phoneme duration is specified. One such problem is the use of the TTS synthesis system for singing voice synthesis research. An algorithm for state duration assignment is derived so as to maximise the probability of the state sequence with the constraints that the sum of state durations should be equal to the total duration of the phoneme and all the state durations must be greater than or equal to 1. Experimental results show that the proposed algorithm always produces state durations greater than or equal to 1 while satisfying the equality constraint.
机译:在最新的文本语音转换(TTS)系统中,在所有状态持续时间之和应等于音素持续时间的约束条件下,生成每个音素的状态持续时间,以便最大化状态序列概率。当指定的音素持续时间小于音素所有状态的均值之和时,这种最大化有时会导致负状态持续时间。这种差异隐含地导致违反平等约束。这对语音研究问题有影响,其中指定了每个音素持续时间。这样的问题之一是使用TTS合成系统进行歌声合成研究。导出状态持续时间分配算法,以最大化状态序列的概率,并具有以下约束条件:状态持续时间之和应等于音素的总持续时间,并且所有状态持续时间必须大于或等于1实验结果表明,该算法在满足等式约束的同时,总是产生大于或等于1的状态持续时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号