...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Synthesizing an emotional voice using prosodic-balanced VCV database
【24h】

Synthesizing an emotional voice using prosodic-balanced VCV database

机译:使用韵律平衡的VCV数据库合成情感声音

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We are now developing an emotional-voice synthesizer based on TD-PSOLA. In this paper, we first give examples of the merit of containing non-linguistic information in synthesized voice. Second, we explain the reason of adopting TD-PSOLA using VCV-wave segments, and suggest the Prosodic-Balanced Database to compensate a fault of the algorithm. Third, we analyze emotional utterances to make balanced database and deriving fomulas which predict phone length. Fourth, we build an emotional-voice synthesizer and synthesize some samples emotional voice. We carried out a hearing test using these samples, and obtained the result that the rate of emotion recognition was 84.1% and intelligibility of synthesized speech was 97.9%.
机译:我们现在正在开发基于TD-PSOLA的语音合成器。在本文中,我们首先给出在合成语音中包含非语言信息的优点的示例。其次,我们解释了采用VCV波段采用TD-PSOLA的原因,并提出了Prosodic-Balanced数据库来弥补该算法的错误。第三,我们分析情绪话语,以建立平衡的数据库并推导预测电话长度的信息群。第四,我们构建了一个情绪语音合成器,并合成了一些样本情绪语音。我们使用这些样本进行了听力测试,得出的结果是,情绪识别率为84.1%,合成语音的清晰度为97.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号