【24h】

The Pause Duration Prediction for Mandarin Text-to-Speech System

机译:汉语普通话语音转换系统的暂停时间预测

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we enter into detailed analysis on how the pause duration under different prosodic boundaries are affected by various contextual factors in natural speech. To get the correlation between them, the paper calculates the mean pause duration under different prosodic boundaries. The contextual factors investigated in this paper contains both linguistic features, such as boundary types, syllable tones of boundary sides, initial and final types etc, and acoustic features, such as pitch gap across the boundary. The paper makes experiments and discussion which reveals the influence of these factors on pause duration. Based on that, the paper creates a pause duration prediction model for mandarin speech synthesis system. The model was proved to be able to generate high quality prosody output with the listening test.
机译:在本文中,我们将详细分析自然韵律中各种情境因素对不同韵律边界下的停顿持续时间的影响。为了获得它们之间的相关性,本文计算了不同韵律边界下的平均停顿持续时间。本文研究的语境因素既包括语言特征(例如边界类型),边界边的音节音调,初始和最终类型等,也包括声学特征(例如边界上的音高差距)。本文进行了实验和讨论,揭示了这些因素对暂停持续时间的影响。在此基础上,建立了普通话语音合成系统的停顿持续时间预测模型。该模型经听力测试证明能够产生高质量的韵律输出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号