首页> 外文会议>INTERSPEECH 2012 >mproved Automatic Extraction of Generation Process Model Commands and Its use for Generating Fundamental Frequency Contours for Training HMM-based Speech Synthesis
【24h】

mproved Automatic Extraction of Generation Process Model Commands and Its use for Generating Fundamental Frequency Contours for Training HMM-based Speech Synthesis

机译:改进了生成过程模型命令的自动提取及其用于生成基于培养的基于HMM的语音合成的基本频率轮廓的用途

获取原文
获取外文期刊封面目录资料

摘要

Generation process model of fundamental frequency (F_0) contours can well represent F_0 movements of speech keeping a clear relation with linguistic information of utterances. Therefore, by using the model, improvement of HMM-based speech synthesis is expected. One of major problems preventing the use of the model is that the performance of automatic extraction of the model parameters from observed F_0 contours is still rather limited. A new method of automatic extraction was developed. Its algorithm is inspired from how humans do, and extracts phrase components first, while conventional methods extract accent components first. Also the method uses linguistic information of texts, which is the same as that used in HMM-based speech synthesis. A significant improvement of extraction is realized. Using the method, the model parameters are extracted for the speech corpus of HMM training, and F_0 contours generated by the model are used for the HMM training instead of the original F_0 contours. Listening experiment of synthetic speech indicates improvements in speech quality.
机译:基频(F_0)轮廓的生成过程模型可以很好地代表语音的动作,保持与语言信息的语言信息明确。因此,通过使用该模型,预期基于HMM的语音合成的改进。防止使用该模型的主要问题之一是,从观察到的F_0轮廓的自动提取模型参数的性能仍然相当有限。开发了一种新的自动提取方法。它的算法激发了人类的方式,并首先提取短语组分,而传统方法首先提取重音组件。此方法还使用文本的语言信息,这与基于HMM的语音合成中使用的语言相同。实现了提取的显着改善。使用该方法,提取模型参数用于HMM训练的语音语料库,并且模型生成的F_0轮廓用于HMM训练而不是原始F_0轮廓。合成语音的听力实验表明语音质量的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号