...
首页> 外文期刊>Journal of Signal and Information Processing >An Intonation Speech Synthesis Model for Indonesian Using Pitch Pattern and Phrase Identification
【24h】

An Intonation Speech Synthesis Model for Indonesian Using Pitch Pattern and Phrase Identification

机译:基于音高模式和短语识别的印尼语声调语音合成模型

获取原文
           

摘要

Prosody in speech synthesis systems (text-to-speech) is a determinant of tone, duration, and loudness of speech sound. Intonation is a part of prosody which determines the speech tone. In Indonesian, intonation is determined by the structure of sentences, types of sentences, and also the position of the word in a sentence. In this study, a model of speech synthesis that focuses on its intonation is proposed. The speech intonation is determined by sentence structure, intonation patterns of the example sentences, and general rules of Indonesian pronunciation. The model receives texts and intonation patterns as inputs. Based on the general principle of Indonesian pronunciation, a prosody file was made. Based on input text, sentence structure is determined and then interval among parts of a sentence (phrase) can be determined. These intervals are used to correct the duration of the initial prosody file. Furthermore, the frequencies in prosody file were corrected using intonation patterns. The final result is prosody file that can be pronounced by speech engine application. Experiment results of studies using the original voice of radio news announcer and the speech synthesis show that the peaks of?F0?are determined by general rules or intonation patterns which are dominant. Similarity test with the PESQ method shows that the result of the synthesis is 1.18 at MOS-LQO scale.
机译:语音合成系统(文本到语音)中的韵律是语音的音调,持续时间和响度的决定因素。语调是韵律的一部分,它决定语调。在印尼语中,语调由句子的结构,句子的类型以及单词在句子中的位置决定。在这项研究中,提出了一种专注于语调的语音合成模型。语音语调由句子结构,例句的语调模式以及印尼语发音的一般规则确定。该模型接收文本和语调模式作为输入。根据印尼语发音的一般原则,制作了韵律文件。根据输入文本,确定句子结构,然后确定句子各部分之间的间隔(短语)。这些间隔用于更正初始韵律文件的持续时间。此外,使用语调模式校正韵律文件中的频率。最终结果是韵律文件,该文件可以由语音引擎应用程序发音。使用广播新闻播音员的原始语音和语音合成进行的研究实验结果表明,“ F0”的峰值是由占主导地位的一般规则或语调模式决定的。用PESQ方法进行的相似性测试表明,在MOS-LQO规模下,合成结果为1.18。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号