首页> 外文会议>Conference of the International Speech Communication Association >Robust Estimation of Multiple-Regression HMM Parameters for Dimension-Based Expressive Dialogue Speech Synthesis
【24h】

Robust Estimation of Multiple-Regression HMM Parameters for Dimension-Based Expressive Dialogue Speech Synthesis

机译:基于维度的表达对话语音合成多元回归HMM参数的鲁棒估计

获取原文

摘要

This paper describes spontaneous dialogue speech synthesis based on multiple-regression hidden semi-Markov model (MRHSMM), which enables users to specify paralinguistic information of synthesized speech with a dimensional representation. Paralinguistic aspects of synthesized speech are controlled by multiple regression models whose explanatory variables are abstract dimensions such as pleasant-unpleasant and arousedsleepy. For robust estimation of the regression matrices of the MRHSMM with unbalanced spontaneous dialogue speech samples, the re-estimation formulae were derived in the framework of the maximum a posteriori (MAP) estimation. The result of a perceptual experiment confirmed that the naturalness of synthesized speech was improved by applying the MAP estimation for regression matrices. In addition a high correlation (R ≈ 0.7) wasobserved between given and perceived paralinguistic information, which implies that the proposed method could successfully reflect intended paralinguistic messages on the synthesized speech.
机译:本文介绍了基于多元回归隐藏半标率模型(MRHSMM)的自发对话综合,这使用户能够使用尺寸表示来指定合成语音的单语言信息。合成语音的预级语言方面由多元回归模型控制,其解释变量是抽象尺寸,如令人愉快的令人不愉快和令人难以释放。对于具有不平衡自发性对话语音样本的MRHSMM的回归矩阵的鲁棒估计,在最大后验(MAP)估计的框架中导出重新估计公式。感知实验的结果证实,通过应用回归矩阵的地图估计来改善合成语音的自然度。另外,在给定和感知的预言信息之间的高相关(R≈0.7),意味着所提出的方法可以成功地反映了合成语音的预期比例信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号