首页> 外文会议>Annual Conference of the International Speech Communication Association >Pause prediction from text for speech synthesis with user-definable pause insertion likelihood threshold
【24h】

Pause prediction from text for speech synthesis with user-definable pause insertion likelihood threshold

机译:通过用户可定义的暂停插入似然阈值,从文本扫描的暂停预测

获取原文

摘要

Predicting the location of pauses from text is an important aspect for speech synthesizers. The accuracy of pause prediction can significantly influence both naturalness and intelligibility. Pauses which help listeners to better parse the synthesized speech into meaningful units are deemed to increase naturalness and intelligibility ratings, while pauses in unexpected or incorrect locations can reduce these ratings and cause confusion. This paper presents a multi-stage pause prediction approach including first prosodic chunk prediction, followed by a feature scoring algorithm and finally a pause sequence evaluation module. Preference tests showed that the new method outperformed a pauses-at-punctuation baseline while not yet matching human performance. In addition, the approach includes two more functionalities: (1) a user-specifiable pause insertion rate and (2) multiple output formats in the form of binary pauses, multi-level pauses or as a score reflecting pause strength.
机译:预测文本暂停的位置是语音合成器的一个重要方面。暂停预测的准确性可以显着影响自然和可懂度。暂停,帮助听众更好地解析为有意义的单位的合成演讲是增加自然和可懂度评级,而意外或不正确的位置的暂停可以减少这些评级并引起混乱。本文介绍了一种多级暂停预测方法,包括第一韵律块预测,其次是特征评分算法,最后是暂停序列评估模块。偏好测试表明,新方法表现出暂停的暂停标点基线,而尚未匹配人类性能。此外,该方法包括两个功能:(1)用户可指定的暂停插入速率和(2)二进制暂停,多级暂停或作为反映暂停强度的分数的多个输出格式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号