Generation method of Lanzhou dialect speech based on Gaussian Mixture Model

机译：基于高斯混合模型的兰州方言语音生成方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dialect generation is one of the most important aspects of Chinese speech synthesis. Using the method of conversion prosodic features we can realize high quality speech synthesis. Firstly, A Lanzhou dialect corpus has been built based on "word-list in dialectal survey" for the generation of Lanzhou dialect. Speech corpus was recorded with contrastive (Lanzhou dialect vs. Mandarin) recordings. A pitch target model is introduced, which is optimized to describe feature parameters of the Mandarin speech and Lanzhou dialect speech in the training set of speech corpus. Secondly, the Gaussian Mixture Model(GMM) can map the subtle prosody distributions between Mandarin and Lanzhou dlialect speech, we train GMM conversion parameter in the training set, and get converted F0 contours of Lanzhou dialect speech by GMM conversion parameter. Using the converted Lanzhou dlialect F0 contours, we can generate high quality Lanzhou dlialect speech by STRAIGHT algorithm. Subjective experiments demonstrated that the generated speech achieve 4.06 of the average mean opinion score(MOS).

机译：方言是中国语音合成最重要的方面之一。使用转换韵律特征的方法，我们可以实现高质量的语音合成。首先，兰州方言语料库已经基于兰州方言代表的“辩证调查词汇清单”。语音语料库被记录在对比（兰州方言与普通话）录音。介绍了一种音高目标模型，该模型被优化，以描述普通话语音的特征参数和兰州语言语音在培训组语料库组中。其次，高斯混合模型（GMM）可以在普通话和兰州DLialect语音之间映射微妙的韵律分布，我们在训练集中培养GMM转换参数，并通过GMM转换参数转换兰州方言语音的F0轮廓。使用转换的兰州DLialect F0轮廓，我们可以通过直线算法产生高质量的兰州DLialect语音。主观实验表明，生成的语音达到了平均意见评分的4.06（MOS）。

著录项

来源
《2011 International Conference on Consumer Electronics, Communications and Networks》|2011年|p.4108-4111|共4页
会议地点
作者
Zhen-ye Gan; Hong-zhi Yu; Hong-wu Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类光电子技术的应用;
关键词
F0 contours; Gaussian Mixture Model; Lanzhou Dialect Speech; Speech Generation;

机译：F0轮廓;高斯混合模型;兰州话;语音产生;

相似文献

外文文献
中文文献
专利

1. A Grasp-pose Generation Method Based on Gaussian Mixture Models [J] . Wu Wenjia International Journal of Advanced Robotic Systems . 2015,第期

机译：一种基于高斯混合模型的掌握姿态生成方法
2. Discriminative training of Gaussian mixture bigram models with application to Chinese dialect identification [J] . Wuei-He Tsai, Wen-Whei Chang Speech Communication . 2002,第3a4期

机译：高斯混合二元模型的判别训练及其在汉语方言识别中的应用
3. Statistical Bandwidth Extension for Speech Synthesis Based on Gaussian Mixture Model with Sub-Band Basis Spectrum Model [J] . Yamato OHTANI, Masatsune TAMURA, Masahiro MORITA, IEICE transactions on information and systems . 2016,第10期

机译：基于子带基谱模型的高斯混合模型的语音合成统计带宽扩展
4. Generation method of Lanzhou dialect speech based on Gaussian Mixture Model [C] . Zhen-ye Gan, Hong-zhi Yu, Hong-wu Yang International Conference on Consumer Electronics, Communications and Networks . 2011

机译：基于高斯混合模型的兰州方言语言的发电方法
5. Mixtures of inverse covariances: Covariance modeling for Gaussian mixtures with applications to automatic speech recognition. [D] . Vanhoucke, Vincent. 2004

机译：逆协方差的混合：高斯混合的协方差建模及其在自动语音识别中的应用。
6. Novel Methods for Surface EMG Analysis and Exploration Based on Multi-Modal Gaussian Mixture Models [O] . Anna Magdalena Vögele, Rebeka R. Zsoldos, Björn Krüger, -1

机译：基于多模态高斯混合模型的表面肌电分析和探索的新方法
7. A Grasp-Pose Generation Method Based on Gaussian Mixture Models [O] . Wenjia Wu 2015

机译：一种基于高斯混合模型的Grasp-pose生成方法
8. Eigen-Channel Compensation and Discriminatively Trained Gaussian Mixture Models for Dialect and Accent Recognition. [R] . Torres-Carrasquillo, P. A., Sturim, D., Reynolds, D. A., 2016

机译：用于方言和口音识别的特征信道补偿和判别训练的高斯混合模型。

Generation method of Lanzhou dialect speech based on Gaussian Mixture Model

摘要

著录项

相似文献

相关主题

期刊订阅