首页> 外文会议>International Conference on Intelligent Information Hiding and Multimedia Signal Processing >Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System
【24h】

Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System

机译:情感语音到语音翻译系统的多种语言情感语音识别与合成

获取原文

摘要

Speech-to-speech translation (S2ST) is the process by which a spoken utterance in one language is used to produce a spoken output in another language. The conventional approach to S2ST has focused on processing linguistic information only by directly translating the spoken utterance from the source language to the target language without taking into account paralinguistic and non-linguistic information such as the emotional states at play in the source language. In this work, we explore how to deal with Para-and non-linguistic information among multiple languages, with a particular focus on speakers' emotional states, in S2ST scenarios called "affective S2ST." In our efforts to construct an effective system, we discuss (1) how to describe emotions in speech and how to model the perception/production of emotions and (2) the commonality and differences among multiple languages in the proposed model. We then use these discussions as context for (3) an examination of our "affective S2ST" system in operation.
机译:语音到语音翻译(S2ST)是一种过程,在该过程中,使用一种语言的语音来生成另一种语言的语音输出。 S2ST的常规方法集中于仅通过直接将语音从源语言转换为目标语言来处理语言信息,而不考虑诸如语言中正在玩的情感状态之类的语言和非语言信息。在这项工作中,我们探索如何在称为“情感S2ST”的S2ST场景中处理多种语言之间的对位和非语言信息,特别关注说话者的情绪状态。在构建有效系统的过程中,我们讨论(1)如何描述语音中的情感以及如何对情感的感知/产生进行建模,以及(2)所提出的模型中多种语言之间的共性和差异。然后,我们将这些讨论用作上下文,作为(3)对正在运行的“情感S2ST”系统的检查。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号