Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System

机译：情感语音到语音翻译系统的多种语言情感语音识别与合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech-to-speech translation (S2ST) is the process by which a spoken utterance in one language is used to produce a spoken output in another language. The conventional approach to S2ST has focused on processing linguistic information only by directly translating the spoken utterance from the source language to the target language without taking into account paralinguistic and non-linguistic information such as the emotional states at play in the source language. In this work, we explore how to deal with Para-and non-linguistic information among multiple languages, with a particular focus on speakers' emotional states, in S2ST scenarios called "affective S2ST." In our efforts to construct an effective system, we discuss (1) how to describe emotions in speech and how to model the perception/production of emotions and (2) the commonality and differences among multiple languages in the proposed model. We then use these discussions as context for (3) an examination of our "affective S2ST" system in operation.

机译：语音到语音翻译（S2ST）是一种过程，在该过程中，使用一种语言的语音来生成另一种语言的语音输出。 S2ST的常规方法集中于仅通过直接将语音从源语言转换为目标语言来处理语言信息，而不考虑诸如语言中正在玩的情感状态之类的语言和非语言信息。在这项工作中，我们探索如何在称为“情感S2ST”的S2ST场景中处理多种语言之间的对位和非语言信息，特别关注说话者的情绪状态。在构建有效系统的过程中，我们讨论（1）如何描述语音中的情感以及如何对情感的感知/产生进行建模，以及（2）所提出的模型中多种语言之间的共性和差异。然后，我们将这些讨论用作上下文，作为（3）对正在运行的“情感S2ST”系统的检查。

著录项

来源
《International Conference on Intelligent Information Hiding and Multimedia Signal Processing》|2014年|574-577|共4页
会议地点
作者
Akagi Masato; Xiao Han; Elbarougy Reda; Hamada Yasuhiro; Junfeng Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
emotion recognition; language translation; natural language processing; speech recognition; speech synthesis; affective S2ST system; affective speech-to-speech translation system; emotion perception-production; emotional speech recognition; emotional speech synthesis; nonlinguistic information; para-linguistic information; Acoustics; Databases; Emotion recognition; Production; Semantics; Speech; Speech recognition; Speech-to-speech translation (S2ST) system; emotion recognition/synthesis; multiple languages; paralinguistic and non-linguistic information;

机译：情感识别;语言翻译;自然语言处理;语音识别;语音合成;情感S2ST系统;情感言语翻译系统;情感感知产生;情感语音识别;情感语音合成;非语言信息;准语言信息;声学;数据库;情感识别;生产;语义学;语音;语音识别;语音到语音翻译（S2ST）系统;情感识别/合成;多种语言;多语言和非语言信息;

相似文献

外文文献
中文文献
专利

1. The Janus-III Translation System: Speech-to-Speech Translation in Multiple Domains [J] . LORI LEVIN, ALON LAVIE, MONIKA WOSZCZYNA Machine Translation . 2000,第1a2期

机译：Janus-III翻译系统：多领域的语音翻译
2. ANUVAADHAK: A Two-way, Indian Language Speech-to-Speech Translation System for Local Travel Information Assistance [J] . VENKATA VINAY BABU VEMULA, PRADEEP KUMAR NARNE, MRUDULA KUDARAVALLI, International Journal of Engineering Science and Technology . 2010,第8期

机译：ANUVAADHAK：一种双向的印度语音到语音翻译系统，用于本地旅行信息协助
3. Affect-insensitive speaker recognition systems via emotional speech clustering using prosodic features [J] . Li Dongdong, Yuan Yubo, Wu Zhaohui, Neural computing & applications . 2015,第2期

机译：使用韵律特征的情感语音聚类，对情感不敏感的说话人识别系统
4. Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System [C] . Akagi Masato, Xiao Han, Elbarougy Reda, International Conference on Intelligent Information Hiding and Multimedia Signal Processing . 2014

机译：朝向情感演讲翻译系统的多种语言中的情感语音识别与综合
5. THE IDENTIFICATION OF EMOTIONAL MEANINGS IN ELECTRONICALLY FILTERED SPEECH: A STUDY OF RIGHT BRAIN INJURED AND NORMAL CHILDREN (RIGHT HEMISPHERE, AFFECT RECOGNITION). [D] . TALLMAN, JOHN HARLAND. 1984

机译：电子过滤语音中的情感含义的识别：对右脑受伤和正常儿童（右半球，情感识别）的研究。
6. Individual differences in language and working memory affect children’s speech recognition in noise [O] . Ryan W. McCreery, Meredith Spratford, Benjamin Kirby, -1

机译：语言和工作记忆的个体差异会影响儿童语音中的语音识别
7. Janus-III: Speech-To-Speech Translation In Multiple Languages [O] . Alon Lavie, Alex Waibel, Lori Levin, 1997

机译：Janus-III：多种语言的语音翻译
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System

摘要

著录项

相似文献

相关主题

期刊订阅