首页> 外文期刊>IEEE transactions on audio, speech and language processing >The IBM expressive text-to-speech synthesis system for American English
【24h】

The IBM expressive text-to-speech synthesis system for American English

机译:适用于美式英语的IBM表达性语音合成系统

获取原文
获取原文并翻译 | 示例

摘要

Expressive text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions which use TTS. We describe a TTS engine which can be directed, via text markup, to use a variety of expressive styles, here, questioning, contrastive emphasis, and conveying good and bad news. Differences in these styles lead us to investigate two approaches for expressive TTS, a "corpus-driven" and a "prosodic-phonology" approach. Each speaker records 11 h (excluding silences) of "neutral" sentences. In the corpus-driven approach, the speaker also records 1-h corpora in each expressive style; these segments are tagged by style for use during search, and decision trees for determining f0 contours and timing are trained separately for each of the neutral and expressive corpora. In the prosodic-phonology approach, rules translating certain expressive markup elements to tones and break indices (ToBI) are manually determined, and the ToBI elements are used in single f0 and duration trees for all expressions. Tests show that listeners identify synthesis in particular styles ranging from 70% correctly for "conveying bad news" to 85% for "yes-no questions". Further improvements are demonstrated through the use of speaker-pooled f0 and duration models.
机译:富有表现力的文字转语音(TTS)合成应有助于使用TTS的基于语音的人机交互的愉悦性,清晰度和速度。我们描述了一个TTS引擎,该引擎可以通过文本标记定向为使用多种表达方式,在这里,提问,对比强调以及传达好消息和坏消息。这些样式的差异使我们研究表达TTS的两种方法,即“语料库驱动”和“韵律音系”方法。每个说话者记录11小时(不包括沉默)的“中立”句子。在语料库驱动的方法中,说话者还记录每种表达方式的1小时语料库;这些段用样式标记,以供在搜索过程中使用,并且分别针对中性和表达语料库训练用于确定f0轮廓和时序的决策树。在韵律语音学方法中,手动确定将某些表达标记元素转换为音调和中断索引(ToBI)的规则,并且ToBI元素用于所有表达式的单个f0和持续时间树中。测试表明,听众可以识别特定风格的综合,范围从“传达坏消息”的正确率达到70%,到“是-没有问题”的正确率达到85%。通过使用扬声器合并的f0和持续时间模型,可以证明进一步的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号