【24h】

A SUPERPOSED PROSODIC MODEL FOR CHINESE TEXT-TO-SPEECH SYNTHESIS

机译:汉语语篇合成的叠加韵律模型

获取原文
获取原文并翻译 | 示例

摘要

The paper presents the application of the trainable SFC superpositional prosodic model to Chinese. Within the SFC model, prosodic parameters (F0, syllabic lengthening) are interpreted as the superposition of overlapping multi-parametric contours. These contours are associated with high-level prosodic features operating at different scopes, such as tones, stress, prosodic boundary, part of speech of words, etc. Each feature label corresponds to a metalinguistic function (morphological, lexical, syntactic, attitudinal...) which is represented by a neural network. The observed contour is the sum of the outputs of the corresponding neural networks. An analysis-by-synthesis scheme is implemented for automatically learning. This model works well in the concatenation of neighbored units. The RMSE of F0 prediction is 2.34st (referenced to 200Hz), correlation is 0.86. Perceptual experiments show that the predicted prosody is quite appropriate and fluent.
机译:本文介绍了可训练的SFC叠加韵律模型在汉语中的应用。在SFC模型中,韵律参数(F0,音节加长)被解释为重叠的多参数轮廓的叠加。这些轮廓与在不同范围内运行的高级韵律特征(例如音调,重音,韵律边界,单词的词性等)相关联。每个特征标签都对应于元语言功能(形态,词法,句法,态度)。 。),由神经网络表示。观察到的轮廓是相应神经网络输出的总和。实施了一种综合分析方案,用于自动学习。该模型在相邻单元的串联中效果很好。 F0预测的RMSE为2.34st(参考200Hz),相关系数为0.86。感知实验表明,预测的韵律是相当适当且流利的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号