首页> 外文期刊>Journal of computer sciences >Analysis of Decision Trees in Context Clustering of Hidden Markov Model Based Thai Speech Synthesis | Science Publications
【24h】

Analysis of Decision Trees in Context Clustering of Hidden Markov Model Based Thai Speech Synthesis | Science Publications

机译:基于隐马尔可夫模型的上下文聚类决策树分析泰式语音合成|科学出版物

获取原文
       

摘要

> Problem statement: In Thai speech synthesis using Hidden Markov model (HMM) based synthesis system, the tonal speech quality is degraded due to tone distortion. This major problem must be treated appropriately to preserve the tone characteristics of each syllable unit. Since tone brings about the intelligibility of the synthesized speech. It is needed to establish the tone questions and other phonetic questions in tree-based context clustering process accordingly. Approach: This study describes the analysis of questions in tree-based context clustering process of an HMM-based speech synthesis system for Thai language. In the system, spectrum, pitch or F0 and state duration are modeled simultaneously in a unified framework of HMM, their parameter distributions are clustered independently by using a decision-tree based context clustering technique. The contextual factors which affect spectrum, pitch and duration, i.e., part of speech, position and number of phones in a syllable, position and number of syllables in a word, position and number of words in a sentence, phone type and tone type, are taken into account for constructing the questions of the decision tree. All in all, thirteen sets of questions are analyzed in comparison. Results: In the experiment, we analyzed the decision trees by counting the number of questions in each node coming from those thirteen sets and by calculating the dominance score given to each question as the reciprocal of the distance from the root node to the question node. The highest number and dominance score are of the set of phonetic type, while the second, third highest ones are of the set of part of speech and tone type. Conclusion: By counting the number of questions in each node and calculating the dominance score, we can set the priority of each question set. All in all, the analysis results bring about further development of Thai speech synthesis with efficient context clustering process in an HMM-based speech synthesis system.
机译: > 问题陈述:在泰语语音合成中使用隐线性马尔可夫模型(HMM)的合成系统,由于色调失真,色调语音质量降低。必须适当地对待这一主要问题以保留每个音节单元的音调特征。由于音调带来了合成语音的可懂度。需要在相应的基于树的上下文聚类过程中建立音调问题和其他语音问题。 方法:本研究描述了泰语语言综合组合系统的基于树的上下文聚类过程中的问题分析。在系统中,频谱,俯仰或F0和状态持续时间在HMM的统一框架中同时建模,通过使用基于决策树的上下文聚类技术独立地聚集它们的参数分布。影响频谱,音高和持续时间的上下文因素,即音节,位置和音节中的音节中的一部分的语音,位置和电话数量,句子,电话类型和音调类型中的单词的位置和数量,被考虑在构建决策树的问题。总而言之,相比之下分析了十三个问题。 结果:在实验中,我们通过计算来自那些十三个集中的每个节点的问题数量,并通过计算每个问题的主导评分作为距离的距离的距离来分析决策树根节点到问题节点。最高的数量和优势分数是该组的语音类型,而第二个第三高的最多是语音和音调类型的一组。 结论:通过计算每个节点中的问题数量并计算主导评分,我们可以设置每个问题集的优先级。总而言之,分析结果为泰国语音组分过程中的高效上下文聚类过程引进了泰语语音合成的进一步发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号