首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units
【24h】

A Concatenative Speech Synthesis Method Using Context Dependent Phoneme Sequences with Variable Length as Search Units

机译:一种基于上下文的变长音素序列作为搜索单元的语音合成方法

获取原文
获取原文并翻译 | 示例
           

摘要

A concatenative speech synthesis method using words and clustered triphones as search units has been proposed. Only 20% words in the sentences, however, satisfy the phoneme context as well as enough quantity in the speech database although most of the words in the sentences can be found in the speech database. So in this paper, we try to use context dependent phoneme sequences not bound by words and to concatenate the speech segments satisfying phoneme context. We produce synthesized speech by proposed method using four different size speech databases. The subjective evaluation on then-naturalness shows that 1) mean opinion score is 3.6, 2) enlargement of speech database more than 43.2 hours doesn't contribute the improvement in naturalness 3) the processing time of speech synthesis shows a little increase with enlargement of speech database.
机译:提出了一种以词和簇三音为搜索单元的级联语音合成方法。尽管句子中的大多数单词都可以在语音数据库中找到,但是句子中只有20%的单词可以满足音素上下文以及语音数据库中足够的数量。因此,在本文中,我们尝试使用不受单词限制的上下文相关音素序列,并连接满足音素上下文的语音段。我们通过使用四个不同大小的语音数据库的建议方法来生成合成语音。对自然的主观评价表明:1)平均意见得分为3.6,2)语音数据库的扩大超过43.2小时没有对自然性的改善做出贡献3)语音合成的处理时间随的扩大而略有增加语音数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号