首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Using variable-sized speech segments as targets for concatenative Speech-to-Speech synthesis
【24h】

Using variable-sized speech segments as targets for concatenative Speech-to-Speech synthesis

机译:使用可变大小的语音段作为连接性语音合成的目标

获取原文
获取原文并翻译 | 示例
           

摘要

Concatenative speech synthesis is growing in popularity due to the high naturalness of its resulting voice quality, but it is still domain-specific and has not yet been tested with conversational speech. We propose a method of unit selection that will overcome some of the problems that have prevented this development. In particular, we address two problems; one is the need for an extremely large database of labelled speech, the other is the incorporation of paralinguistic information in the speech synthesis. In our proposed 'speech-to-speech' method, we use acoustic criteria to segment the database into variable-sized units, and then use an acoustic waveform as a target for the unit-selection search. In a final stage, prosodic criteria are applied to select the optimal sequence of units for the output waveform generation. In this paper, we describe the techniques for segmenting the large speech database and the acoustic criteria used for unit selection. We present results comparing two methods of speech database segmentation, and further results from accuracy based on phonetic labels and a perceptual test which confirm the intelligibility and naturalness and accuracy of dictation.
机译:由于其语音质量的高度自然,所以连接性语音合成越来越受欢迎,但它仍然是域名的,尚未通过会话语音测试。我们提出了一种单位选择的方法,将克服一些阻止这种发展的问题。特别是,我们解决了两个问题;一个是需要一个非常大的标记语音数据库,另一个是在语音合成中的汇编信息的结合。在我们提出的“演讲到语音”方法中,我们使用声标在变量大小的单元中将数据库进行分段,然后使用声波形式作为单位选择搜索的目标。在最后阶段,应用韵律标准来选择输出波形生成的最佳单位序列。在本文中,我们描述了用于分割大语音数据库的技术和用于单元选择的声学标准。我们呈现结果比较两种语音数据库分割方法,以及基于拼音标签的精度和感知测试的进一步产生,这证实了听证性和自然性和准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号