首页> 外文会议>Annual Conference of the International Speech Communication Association >Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach
【24h】

Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach

机译:利用增强的嵌入特征和模型融合方法改善普通话语音合成的韵律边界预测

获取原文

摘要

Hierarchical prosody structure generation is an important but challenging component for speech synthesis systems. In this paper, we investigate the use of enhanced embedding (joint learning of character and word embedding (CWE)) features and different model fusion approaches at both character and word level for Mandarin prosodic boundaries prediction. For CWE module, the internal structures of words and non-compositional words are considered in the word embedding, while the character ambiguity is addressed by multiple-prototype character embedding. For model fusion module, linear function (LF) and gradient boosting decision tree (GBDT), are investigated at the decision level respectively, with the important features selected by feature ranking module used as its input. Experiment results show the effectiveness of the proposed enhanced embedding features and the two model fusion approaches at both character and word level.
机译:等级韵律结构生成是语音合成系统的重要而挑战性的组件。在本文中,我们调查了增强嵌入的使用(共同学习字符和词嵌入(CWE))特征和不同的模型融合方法,以及普通话韵律边界预测的字符和单词级别。对于CWE模块,在嵌入单词中考虑单词和非组成单词的内部结构,而字符模糊是由多原型字符嵌入解决的。对于模型融合模块,线性函数(LF)和渐变升压决策树(GBDT)分别在决策级别进行调查,具有通过用作其输入的特征排序模块选择的重要特征。实验结果表明,建议的增强嵌入功能和两个模型融合方法的有效性和字符和字级别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号