首页> 外文期刊>哈尔滨工业大学学报(英文版) >Pitch models of Mandarin text-to-speech
【24h】

Pitch models of Mandarin text-to-speech

机译:普通话转语音的音高模型

获取原文
获取原文并翻译 | 示例
       

摘要

The function of prosody model will directly affect the naturalness of synthesized speech. Aimed at the difficulty in generating the pitch contour in prosody model, two pitch models namely corpus-based pitch model and pitch pattern model are deeply studied in this paper. Key problems in the corpus-based model are calculation of the distance and searching of the optimal path with dynamic programming algorithm. For the pitch pattern model, parameters such as pitch pattern, pitch average and pitch range are used to describe the pitch contour,and six pitch patterns are presented. For the generation of pitch contour, the pitch pattern model is more flexible than the corpus-based model. Both of the two models are linked to the real TTS system, and the MOS results of synthesized Mandarin speech show that the pitch pattern model is better than the corpus-based pitch model.
机译:韵律模型的功能将直接影响合成语音的自然性。针对韵律模型中音高轮廓难以产生的问题,本文深入研究了基于语料库的音高模型和音高模式模型这两种音高模型。基于语料库的模型中的关键问题是距离的计算和动态规划算法的最佳路径搜索。对于音高模式模型,使用诸如音高模式,音高平均值和音高范围等参数来描述音高轮廓,并提出了六个音高模式。对于音高轮廓的生成,音高模式模型比基于语料库的模型更灵活。这两个模型都与真实的TTS系统链接,合成的普通话语音的MOS结果表明,音高模式模型优于基于语料库的音高模型。

著录项

  • 来源
    《哈尔滨工业大学学报(英文版)》 |2009年第2期|179-184|共6页
  • 作者单位

    Institute of Computational Linguistics, Peking University, Peking 100871, China;

    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001 ,China;

    Institute of Computational Linguistics, Peking University, Peking 100871, China;

    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001 ,China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类 模式识别与装置;
  • 关键词

  • 入库时间 2022-08-19 03:41:10
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号