首页> 外文会议>Society for the Study of Artificial Intelligence and Simulation of Behaviour Convention Communication, Interaction an Social Intelligence >Interplay between pragmatic and acoustic level to embody expressive cues in a Text to Speech system
【24h】

Interplay between pragmatic and acoustic level to embody expressive cues in a Text to Speech system

机译:务实和声学水平之间的相互作用,以体现文本中的表达线索到语音系统

获取原文

摘要

This paper deals with the problem of generating emotional speech within the Unit Selection approach to text to speech synthesis. By taking into account state-of-the-art research in different fields, from psychology to linguistics, we claim that a complex interplay between the phonetic level and the pragmatic level of language constitutes the basis of voice expression of emotions, and that the phonetic-pragmatics interplay can be accounted for in a text-to-speech system by providing accurate representations of contextually relevant discourse markers. The availability of an inventory of expressive cues implementing discourse markers, can improve the naturalness and expressivity of generated speech, moving toward the ambitious goal of emotional speech generation.
机译:本文涉及在单位选择方法中产生情绪言论的问题,以文本到语音合成。通过考虑到不同领域的最先进的研究,从心理学到语言学,我们声称语音水平与语言语言之间的复杂相互作用构成了情绪的语音表达的基础,以及语音 - 通过提供关于上下文相关的话语标记的准确表示,可以在文本到语音系统中进行代码代码。实施话语标记的表达线索库存的可用性可以提高生成讲话的自然和表现,朝着情感语音一代的雄心勃勃的目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号