Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model

机译：使用连续隐藏马尔可夫模型的基于HMM的语音合成中的显式持续时间建模

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel approach to explicit duration modelling for HMM-based speech synthesis. The proposed approach is a two-step process. The first step in this process is state level phone alignment and conversion of phone durations into the number of frames. In the second step, a hidden Markov model (HMM) is trained whereby the observation is the number of frames in each state and the hidden state the phone. Finally, the duration of each state (the number of frames) is generated from the trained HMM. Hidden semi-Markov model (HSMM) is the baseline for explicit duration modelling in HMM-based speech synthesis. Both objective and perceptual evaluation on a held-out test set showed comparable results with a baseline HSMM-based speech synthesis. This duration modelling approach is computationally simpler than HSMM and produces comparable results in terms of the quality of synthetic speech.

机译：本文提出了一种新的方法，用于基于HMM的语音合成的显式持续时间建模。所提出的方法是一个两步过程。此过程的第一步是状态级别的电话对齐，并将电话持续时间转换为帧数。在第二步中，训练了隐马尔可夫模型（HMM），其中观察值是每种状态下的帧数以及电话的隐形状态。最后，每个状态的持续时间（帧数）是从受过训练的HMM中生成的。隐藏的半马尔可夫模型（HSMM）是基于HMM的语音合成中显式持续时间建模的基准。坚持测试集上的客观评估和感性评估均显示出与基于HSMM基线语音合成的结果相当的结果。这种持续时间建模方法在计算上比HSMM更简单，并且在合成语音的质量方面可产生可比的结果。

著录项

来源
《2012 11th International Conference on Information Science, Signal Processing and their Applications.》|2012年|p.700- 705|共6页
会议地点 Montreal(CA)
作者
Ogbureke Udochukwu; Cabral Joao; Berndsen Julie;
展开▼
作者单位

CNGL, School of Computer Science and Informatics, University College Dublin, Belfield, Dublin 4, Ireland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词

相似文献

外文文献
中文文献
专利

1. Modeling acoustic transitions in speech by modified hidden Markovmodels with state duration and state duration-dependent observationprobabilities [J] . Park Y.K., Un C.K., Kwon O.W. IEEE Transactions on Speech and Audio Proceessing . 1996,第5期

机译：使用状态持续时间和状态持续时间相关的观察概率，通过修改后的隐马尔可夫模型对语音中的声跃迁建模
2. Modeling acoustic transitions in speech by modified hidden Markov models with state duration and state duration-dependent observation probabilities [J] . Park Y.K., Un C.K. IEEE Transactions on Speech and Audio Proceeding . 1996,第5期

机译：通过修正的隐马尔可夫模型对语音中的声跃迁建模，该模型具有状态持续时间和状态持续时间相关的观察概率
3. Optimal state duration assignment in hidden Markov model-based text-to-speech synthesis system [J] . Khan Najeeb Ullah, Jung-Chul Lee Electronics Letters . 2015,第12期

机译：基于隐马尔可夫模型的文本语音合成系统中的最佳状态持续时间分配
4. Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model [C] . Ogbureke Udochukwu, Cabral Joao, Berndsen Julie International Conference on Information Science, Signal Processing and Their Applications . 2012

机译：基于HMM的语音合成中的显式持续时间建模使用连续隐马尔可夫模型
5. Hidden Markov models for visual speech synthesis in limited data environments. [D] . Arb, Harold Allan. 2001

机译：用于有限数据环境中视觉语音合成的隐马尔可夫模型。
6. Explicit-Duration Hidden Markov Model Inference of UP-DOWN States from Continuous Signals [O] . James M. McFarland, Thomas T. G. Hahn, Mayank R. Mehta 2008

机译：连续信号的UP-DOWN状态的显式持续时间隐马尔可夫模型推断
7. Explicit-Duration Hidden Markov Model Inference of UP-DOWN States from Continuous Signals [O] . McFarland, James M., Hahn, Thomas T. G., Mehta, Mayank R. 2011

机译：连续信号的UP-DOWN状态的显式持续时间隐马尔可夫模型推断
8. Explicit Modelling of State Duration Correlations in Hidden Markov Models [R] . Russell, M. J., Sime, L. 1988

机译：隐马尔可夫模型中状态持续时间相关的显式建模

Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model

摘要

著录项

相似文献

相关主题

期刊订阅