首页> 外文会议>2011 IEEE International Conference on Acoustics, Speech and Signal Processing >Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis
【24h】

Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis

机译:在基于平均语音的语音合成中使用量化的F0符号进行音调上下文标记以提高音调正确性

获取原文

摘要

This paper proposes a technique for improving tone correctness in Thai speech synthesis based on an average voice model trained with nonprofessional speech corpus. The proposed technique utilizes quantized F0 symbols as the tonal context in order to obtain an appropriate F0 model. With this technique, the prosodic context can be extracted from real speech directly and this leads to prevent the inconsistency between speech data and F0 labels generated from transcription, which affects the naturalness and tone correctness in synthetic speech. We examine two types of tonal context labeling using the quantized F0 symbols based on phone and sub-phone boundaries. Experimental results of both objective and subjective tests show that the proposed technique can improve not only the naturalness but also the tone correctness of synthetic speech under condition of using a small amount speech data of nonprofessional target speakers.
机译:本文提出了一种基于非专业语音语料库训练的平均语音模型来提高泰语语音合成中音调正确性的技术。所提出的技术利用量化的F0符号作为音调上下文以获得适当的F0模型。利用这种技术,可以直接从真实语音中提取韵律情境,从而防止了语音数据与转录产生的F0标签之间的不一致,从而影响了合成语音的自然性和音调正确性。我们使用基于电话和子电话边界的量化F0符号来检查两种类型的色调上下文标记。客观测试和主观测试的实验结果表明,在使用少量非专业目标说话人语音数据的条件下,该技术不仅可以提高合成语音的自然性,而且可以提高合成语音的语气正确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号