首页> 外文会议>2011 International Conference on Consumer Electronics, Communications and Networks >Generation method of Lanzhou dialect speech based on Gaussian Mixture Model
【24h】

Generation method of Lanzhou dialect speech based on Gaussian Mixture Model

机译:基于高斯混合模型的兰州方言语音生成方法

获取原文

摘要

Dialect generation is one of the most important aspects of Chinese speech synthesis. Using the method of conversion prosodic features we can realize high quality speech synthesis. Firstly, A Lanzhou dialect corpus has been built based on "word-list in dialectal survey" for the generation of Lanzhou dialect. Speech corpus was recorded with contrastive (Lanzhou dialect vs. Mandarin) recordings. A pitch target model is introduced, which is optimized to describe feature parameters of the Mandarin speech and Lanzhou dialect speech in the training set of speech corpus. Secondly, the Gaussian Mixture Model(GMM) can map the subtle prosody distributions between Mandarin and Lanzhou dlialect speech, we train GMM conversion parameter in the training set, and get converted F0 contours of Lanzhou dialect speech by GMM conversion parameter. Using the converted Lanzhou dlialect F0 contours, we can generate high quality Lanzhou dlialect speech by STRAIGHT algorithm. Subjective experiments demonstrated that the generated speech achieve 4.06 of the average mean opinion score(MOS).
机译:方言是中国语音合成最重要的方面之一。使用转换韵律特征的方法,我们可以实现高质量的语音合成。首先,兰州方言语料库已经基于兰州方言代表的“辩证调查词汇清单”。语音语料库被记录在对比(兰州方言与普通话)录音。介绍了一种音高目标模型,该模型被优化,以描述普通话语音的特征参数和兰州语言语音在培训组语料库组中。其次,高斯混合模型(GMM)可以在普通话和兰州DLialect语音之间映射微妙的韵律分布,我们在训练集中培养GMM转换参数,并通过GMM转换参数转换兰州方言语音的F0轮廓。使用转换的兰州DLialect F0轮廓,我们可以通过直线算法产生高质量的兰州DLialect语音。主观实验表明,生成的语音达到了平均意见评分的4.06(MOS)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号