...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Candidate Expansion and Prosody Adjustment for Natural Speech Synthesis Using a Small Corpus
【24h】

Candidate Expansion and Prosody Adjustment for Natural Speech Synthesis Using a Small Corpus

机译:使用小语料库进行自然语音合成的候选音调和韵律调整

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This study proposes a hybrid approach to natural-sounding speech synthesis based on candidate expansion, unit selection, and prosody adjustment using a small corpus. The proposed method is more specific to tonal language, in particular Mandarin. In conventional speech synthesis studies, the quality of synthesized speech depends heavily on the size of the speech corpus. However, it is highly time-consuming and labor-intensive to prepare a large labeled corpus. In this work, candidate expansion is proposed to retrieve potential candidates that are unlikely to be retrieved using only linguistic features. The optimal unit sequence is then obtained from the expanded candidates by using the proposed unit selection mechanism at the phoneme and prosodic word levels. Finally, a prosodic word-level prosody adjustment is proposed to improve the continuity and smoothness of the prosody of the synthesized speech. To evaluate the proposed method, the Tsing-Hua corpus of speech synthesis was adopted. The results of an objective evaluation demonstrate the effectiveness of candidate expansion and the improvement of the continuity and smoothness of the prosody of the synthesized speech. The results of a subjective evaluation also show the proposed system could synthesize the speech with improved quality and naturalness, in particular for a small-sized or resource-limited corpus.
机译:这项研究提出了一种基于候选扩展,​​单位选择和使用小语料库的韵律调整的自然声音语音合成的混合方法。所提出的方法更特定于音调语言,尤其是普通话。在常规语音合成研究中,合成语音的质量在很大程度上取决于语音语料库的大小。但是,准备一个大型的标记语料库非常耗时且费力。在这项工作中,提出了候选扩展以检索不太可能仅使用语言特征检索的潜在候选。然后,通过在音素和韵律词级别上使用建议的单位选择机制,从扩展的候选项中获得最佳单位序列。最后,提出了韵律水平的韵律调整,以提高合成语音韵律的连续性和流畅性。为了评估所提出的方法,采用了清华语音合成语料库。客观评估的结果证明了候选扩展的有效性以及合成语音韵律的连续性和平滑性的提高。主观评价的结果还表明,所提出的系统能够以提高的质量和自然度来合成语音,特别是对于小型或资源有限的语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号