首页> 外文会议>International conference on spoken language processing >Using Bayesian Belief Networks for model duration in text-to-speech systems
【24h】

Using Bayesian Belief Networks for model duration in text-to-speech systems

机译:使用贝叶斯信仰网络进行模型持续时间在文本到语音系统中

获取原文

摘要

The problems of database imbalance and factor interaction make modelling of segment duration i ntext-to-speech systems a challenging task. We therefore propos a probabilistic Bayesian belief network (BN) approach to tackle data sparsity and factor interaction problems. The belief network approach makes good estimations in cases of missed or incomplete data. Also, it captures factor interaction in a concise way of causal relationships among the nodes in a directed acyclic (DAG) graph. Furthermore, a belief network approach allows a significant reduction of the number of parameters to be estiamted. In our work, we model segment duration as a hybrid Bayesian network consisting of discrete and continuous nodes; each node in the network represents a linguistic factor that affects segmental duration. The interaction between the factors is represented as conditional dependence relations in the graphical model. We contrasted the results of belief network model with those of sums of products model and classification and regression tree (CART) model. We trained and tested all three models on the same data. Our new model significantly out-performs CART: the belief network achieves a RMS error of 5 milliseconds compared with 20 ms from CART. The SoP model also produces an error of 9 ms, and hence our new model isn't any worse in terms of final performance. However, we think our model has many other advantages compared to SoP, for instance it is much easier to configure and experiemtn with new features. This should make it easier to adapt to new languages.
机译:数据库不平衡和因子交互问题使分段持续时间I NText-to-Mexical Systems建模成为一个具有挑战性的任务。因此,我们可以提出一种概率贝叶斯信念网络(BN)方法来解决数据稀疏性和因子互动问题。信仰网络方法在错过或不完整数据的情况下良好的估算。此外,它以指向的非循环(DAG)图中的节点中的节点中的陈述关系捕获因子交互。此外,信仰网络方法允许显着降低iseSiamt的参数的数量。在我们的工作中,我们将分段持续时间模拟为包括离散和连续节点的混合贝叶斯网络;网络中的每个节点代表了影响分段持续时间的语言因素。因子之间的相互作用表示为图形模型中的条件依赖关系。我们将信仰网络模型与产品模型和分类和回归树(购物车)模型的总和进行了鲜明对比。我们在同一数据上培训并测试了所有三种模型。我们的新模型显着外出了:信念网络与来自购物车20毫秒相比,达到5毫秒的RMS误差。 SOP模型还产生9毫秒的错误,因此我们的新模型在最终表现方面并不差。但是,我们认为我们的模型与SOP相比有许多其他优点,例如,使用新功能配置和体验和体验更容易。这应该使其更容易适应新语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号