A corpus-based speech synthesis system with emotion

Akemi Iida; Nick Campbell; Fumito Higuchi; Michiaki Yasumura

首页> 外文期刊>Soil mechanics and foundation engineering >A corpus-based speech synthesis system with emotion

【24h】

A corpus-based speech synthesis system with emotion

机译：一种基于语料库的情感情感合成系统

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new approach to synthesizing emotional speech by a corpus-based concatenative speech synthesis system (ATR CHATR) using speech corpora of emotional speech. In this study, neither emotional-dependent prosody prediction nor signal processing per se is performed for emotional speech. Instead, a large speech corpus is created per emotion to synthesize speech with the appropriate emotion by simple switching between the emotional corpora. This is made possible by the normalization procedure incorporated in CHATR that transforms its standard predicted prosody range according to the source database in use. We evaluate our approach by creating three kinds of emotional speech corpus (anger, joy, and sadness) from recordings of a male and a female speaker of Japanese. The acoustic characteristics of each corpus are different and the emotions identifiable. The acoustic characteristics of each emotional utterance synthesized by our method show clear correlations to those of each corpus. Perceptual experiments using synthesized speech confirmed that our method can synthesize recognizably emotional speech. We further evaluated the method's intelligibility and the overall impression it gives to the listeners. The results show that the proposed method can synthesize speech with a high intelligibility and gives a favorable impression. With these encouraging results, we have developed a workable text-to-speech system with emotion to support the immediate needs of nonspeaking individuals. This paper describes the proposed method, the design and acoustic characteristics of the corpora, and the results of the perceptual evaluations.

机译：我们提出了一种新的方法，通过使用基于情感语料的语料库基于语料的连接语音合成系统（ATR CHATR）来合成情感语料。在这项研究中，对于情感语音，既不执行依赖于情感的韵律预测，也不执行信号处理本身。取而代之的是，为每个情感创建一个大型语音语料库，以通过在情感语料库之间进行简单切换来将语音与适当的情感合成。通过合并在CHATR中的归一化程序使之成为可能，该规程根据使用的源数据库转换其标准预测韵律范围。我们通过从讲日语的男性和女性的录音中创建三种情感语音语料库（愤怒，欢乐和悲伤）来评估我们的方法。每个语料库的声学特性都不同，并且可以识别出各种情感。通过我们的方法合成的每种情绪话语的声学特性都与各个语料库具有明显的相关性。使用合成语音的感知实验证实了我们的方法可以合成可识别的情感语音。我们进一步评估了该方法的清晰度以及它给听众的总体印象。结果表明，所提方法具有较高的清晰度，并且具有良好的印象。有了这些令人鼓舞的结果，我们开发了一种可行的带有语音功能的语音朗读系统，以支持不说话的人的即时需求。本文介绍了提出的方法，语料库的设计和声学特性，以及知觉评估的结果。

著录项

来源
《Soil mechanics and foundation engineering》 |2003年第2期|p.161-187|共27页
作者
Akemi Iida; Nick Campbell; Fumito Higuchi; Michiaki Yasumura;
展开▼
作者单位

Keio Research Institute at SFC, Keio University, 5322, Endo, Fujisawa-city, Kanagawa, 252-8520, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类建筑施工;
关键词

相似文献

外文文献
中文文献
专利

1. A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems [J] . Francisco Campillo Díaz, Eduardo Rodríguez Banga Speech Communication . 2006,第8期

机译：基于语料库的语音合成系统中语调建模与语音单元选择相结合的方法
2. Corpus Design for Malay Corpus-based Speech Synthesis System | Science Publications [J] . Sh-Hussain, Tian-Swee Tan American journal of applied sciences . 2009,第4期

机译：基于马来语语料库的语音合成系统的语料库设计科学出版物
3. A Corpus-Based Concatenative Speech Synthesis System for Turkish [J] . HA??M SAK, TUNGA GüNG?R, YA?AR SAFKAN Turkish Journal of Electrical Engineering and Computer Sciences . 2006,第2期

机译：基于语料库的土耳其语级联语音合成系统
4. Emotion Controllable Speech Synthesis Using Emotion-Unlabeled Dataset with the Assistance of Cross-Domain Speech Emotion Recognition [C] . Xiong Cai, Dongyang Dai, Zhiyong Wu, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：在跨域语音情感认可的帮助下，使用情感 - 未标记的数据集进行情感可控语音合成
5. Corpus-based analysis of body-part terms for emotions and feelings in English and Japanese. [D] . Tsurumi, Keiko. 2015

机译：基于语料库的英语和日语情感和感觉身体部位术语分析。
6. Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features [O] . Tursunov Anvarjon, Mustaqeem, Soonil Kwon 2020

机译：深网络：使用深频特征的基于轻量级CNN的语音情感识别系统
7. Corpus design for Malay corpus-based speech synthesis system [O] . Tan, Tian-Swee, Sh-Hussain, Sh-Hussain 2009

机译：马来语料库语音合成系统的语料库设计

A corpus-based speech synthesis system with emotion

摘要

著录项

相似文献

相关主题

期刊订阅