首页> 外文OA文献 >Bayesian modelling of vowel segment duration for text-to-speech synthesis using distinctive features
【2h】

Bayesian modelling of vowel segment duration for text-to-speech synthesis using distinctive features

机译:贝叶斯元音段持续时间的建模,使用独特功能进行文本到语音合成

摘要

We report the results of applying the Bayesian Belief Network (BN) approach to predicting vowel duration. A Bayesian inference of the vowel duration is performed on a hybrid Bayesian network consisting of discrete and continuous nodes, with the nodes in the network representing the linguistic factors that affect segment duration. New to the present research, we model segment identity factor as a set of distinctive features. The features chosen were height, frontness, length, and roundness. We also experimented with a word class feature that implicitly represents word frequency information. We contrasted the results of the belief network model with those of the sums of products (SoP) model and classification and regression tree (CART) model. We trained and tested all three models on the same data. In terms of the RMS error and correlation coefficient, our BN model performs no worse than SoP model, and it significantly outperforms CART model.
机译:我们报告应用贝叶斯信仰网络(BN)方法预测元音持续时间的结果。在由离散节点和连续节点组成的混合贝叶斯网络上执行元音持续时间的贝叶斯推断,网络中的节点代表影响片段持续时间的语言因素。本研究的新手,我们将段标识因子建模为一组独特的功能。选择的特征是高度,正面,长度和圆度。我们还试验了隐含表示词频信息的词类功能。我们将置信网络模型的结果与乘积和(SoP)模型以及分类和回归树(CART)模型的结果进行了对比。我们在相同的数据上训练并测试了所有三个模型。就RMS误差和相关系数而言,我们的BN模型的性能不比SoP模型差,并且其性能明显优于CART模型。

著录项

  • 作者

    Goubanova Olga V;

  • 作者单位
  • 年度 2003
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号