Synthesizing an emotional voice using prosodic-balanced VCV database

Masaki Kasamatsu; Takuya Nishimoto; Masahiro Araki; Yasuhisa Niimi

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Synthesizing an emotional voice using prosodic-balanced VCV database

【24h】

Synthesizing an emotional voice using prosodic-balanced VCV database

机译：使用韵律平衡的VCV数据库合成情感声音

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We are now developing an emotional-voice synthesizer based on TD-PSOLA. In this paper, we first give examples of the merit of containing non-linguistic information in synthesized voice. Second, we explain the reason of adopting TD-PSOLA using VCV-wave segments, and suggest the Prosodic-Balanced Database to compensate a fault of the algorithm. Third, we analyze emotional utterances to make balanced database and deriving fomulas which predict phone length. Fourth, we build an emotional-voice synthesizer and synthesize some samples emotional voice. We carried out a hearing test using these samples, and obtained the result that the rate of emotion recognition was 84.1% and intelligibility of synthesized speech was 97.9%.

机译：我们现在正在开发基于TD-PSOLA的语音合成器。在本文中，我们首先给出在合成语音中包含非语言信息的优点的示例。其次，我们解释了采用VCV波段采用TD-PSOLA的原因，并提出了Prosodic-Balanced数据库来弥补该算法的错误。第三，我们分析情绪话语，以建立平衡的数据库并推导预测电话长度的信息群。第四，我们构建了一个情绪语音合成器，并合成了一些样本情绪语音。我们使用这些样本进行了听力测试，得出的结果是，情绪识别率为84.1％，合成语音的清晰度为97.9％。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2000年第726期|共6页
作者
Masaki Kasamatsu; Takuya Nishimoto; Masahiro Araki; Yasuhisa Niimi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类电报、传真;
关键词
Emotional voice; Speech synthesis; TD-PSOLA; Prosodic-balanced database;

机译：情感语音语音合成TD-PSOLA韵律平衡数据库;

相似文献

外文文献
中文文献
专利

1. Synthesizing an emotional voice using prosodic-balanced VCV database [J] . Masaki Kasamatsu, Takuya Nishimoto, Masahiro Araki, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第726期

机译：使用韵律平衡的VCV数据库合成情感声音
2. Multimodal processing of emotional information in 9-month-old infants I: Emotional faces and voices [J] . Otte R. A., Donkers F. C. L., Braeken M. A. K. A., Brain and cognition . 2015,第apra期

机译：9个月大婴儿情绪信息的多模态处理I：情绪面孔和声音
3. A Markov Chain Analysis of Emotional Exchange in Voice-to-Voice Communication: Testing for the Mimicry Hypothesis of Emotional Contagion [J] . Rueff-Lopes Rita, Navarro Jose, Caetano Antonio, Human communication research . 2015,第3期

机译：语音对语音交流中的情感交流的马尔可夫链分析：情感传染的模仿假设检验
4. Adding emotional factors to synthesized voices [C] . Hara F. Robot and Human Communication, 1997. RO-MAN '97. Proceedings., 6th IEEE International Workshop on . 1997

机译：在合成声音中添加情感因素
5. Pairing media-captured human versus computer-synthesized humanoid faces and voices for talking heads: A consistency theory for interface agents. [D] . Gong, Li. 2001

机译：将媒体捕获的人与计算机合成的人形面部和声音配对以用于说话人：接口代理的一致性理论。
6. Seeing a Face in a Crowd of Emotional Voices: Changes in Perception and Cortisol in Response to Emotional Information across the Senses [O] . Sarah C. Izen, Hannah E. Lapp, Daniel A. Harris, 2019

机译：在情绪化的声音人群中看到一张脸：跨感官的情绪信息对感知和皮质醇的改变
7. Constructing physically realistic VCV stimuli for the perception of stop voicing in European Portuguese [O] . Daniel Pape, Luis M. T. Jesus, Pascal Perrier 2012

机译：为欧洲葡萄牙语中的停止发声构建物理逼真的VCV刺激

Synthesizing an emotional voice using prosodic-balanced VCV database

摘要

著录项

相似文献

相关主题

期刊订阅