Analyzing the impact of including listener perception annotations in RNN-based emotional speech synthesis

Jaime Lorenzo-Trueba; Gustav Eje Henter; Shinji Takaki; Junichi Yamagishi

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Analyzing the impact of including listener perception annotations in RNN-based emotional speech synthesis

【24h】

Analyzing the impact of including listener perception annotations in RNN-based emotional speech synthesis

机译：在基于RNN的情感语音合成中的监督者感知注释在内的影响分析

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper investigates simultaneous modeling of multiple emotions in DNN-based expressive speech synthesis, and how to represent the emotional labels, such as emotional class and strength, for this task. Our goal is to answer two questions: First, what is the best way to annotate speech data with multiple emotions? Second, how should the emotional information be represented as labels for supervised DNN training? We evaluate on a large-scale corpus of emotional speech from a professional actress, additionally annotated with perceived emotional labels from crowd-sourced listeners. By comparing DNN-based speech synthesizers that utilize different emotional representations, we assess the impact of these representations and design decisions on human emotion recognition rates.

机译：本文调查了基于DNN的表达语音合成中多种情绪的同时建模，以及如何代表这项任务的情绪标签，如情绪阶级和力量。我们的目标是回答两个问题：首先，用多种情绪注释语音数据的最佳方式是什么？其次，情绪信息应该如何表示为监督DNN培训的标签？我们评估了从专业女演员的大规模情绪讲话中，另外用来自人群兴趣的听众的感知情绪标签诠释。通过比较利用不同情绪表现的DNN的语音合成器，我们评估这些陈述和设计决策对人类情感识别率的影响。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2017年第368期|共2页
作者
Jaime Lorenzo-Trueba; Gustav Eje Henter; Shinji Takaki; Junichi Yamagishi;
展开▼
作者单位

National Institute of Informatics;

National Institute of Informatics;

National Institute of Informatics;

National Institute of Informatics;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电报、传真;
关键词
Emotional speech synthesis; Recurrent neural networks; Speech perception;

机译：情绪语音合成;复发性神经网络;语音感知;

相似文献

外文文献
中文文献
专利

1. Analyzing the impact of including listener perception annotations in RNN-based emotional speech synthesis [J] . Jaime Lorenzo-Trueba, Gustav Eje Henter, Shinji Takaki, 電子情報通信学会技術研究報告. 音声. Speech . 2017,第368期

机译：在基于RNN的情感语音合成中的监督者感知注释在内的影响分析
2. The Role of "Active Listening" in Informal Helping Conversations: Impact on Perceptions of Listener Helpfulness, Sensitivity, and Supportiveness and Discloser Emotional Improvement [J] . Graham D. Bodie, Andrea J. Vickery, Kaitlin Cannava, Western Journal of Communication . 2015,第2期

机译：“主动倾听”在非正式帮助性对话中的作用：对听众的帮助，敏感性和支持感以及公开情绪改善的感知的影响
3. Perception of Speech Produced by Native and Nonnative Talkers by Listeners with Normal Hearing and Listeners with Cochlear Implants [J] . Ji Caili, Galvin John J., Chang Yi-ping, Journal of speech, language, and hearing research: JSLHR . 2014,第2期

机译：正常听觉的听众和人工耳蜗的听众对母语者和非母语者讲话的感知
4. Impact of speaker variability on speech perception in non-native listeners [C] . Wim A. van Dommelen, Valerie Hazan Annual conference of the International Speech Communication Association;INTERSPEECH 2011 . 2011

机译：说话人差异对非母语听众语音感知的影响
5. Machine-learning Solutions for Emotional Speech: Exploiting the Information of Individual Annotations [D] . Lotfian, Reza. 2018

机译：机器学习解决方案，用于情绪言论：利用个人注释的信息
6. Normal-Hearing Listeners’ and Cochlear Implant Users’ Perception of Pitch Cues in Emotional Speech [O] . Steven Gilbers, Christina Fuller, Dicky Gilbers, 2015

机译：听觉正常的听众和人工耳蜗用户对情感言语中提示音的感知
7. Normal-Hearing Listeners’ and Cochlear Implant Users’ Perception of Pitch Cues in Emotional Speech [O] . Steven Gilbers, Christina Fuller, Dicky Gilbers, 2015

机译：正常听力听众和人工耳蜗用户对情绪语音中音高线索的感知

Analyzing the impact of including listener perception annotations in RNN-based emotional speech synthesis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅