Duration Refinement for Hybrid Speech Synthesis System using Random Forest

机译：随机林混合语音合成系统的持续时间细化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The hybrid speech synthesis system which combines the hidden Markov model and unit selection method has been widely used and researched in both industry and academia recently due to its naturalness and expressiveness. However, the target duration, which is used to control the duration of selected candidate, is still predicted via the state-based duration model, whose performance is far from satisfactory. As a result, the synthetic speech sounds somewhat bland and even tedious. In this paper, we replace the state-based duration model with Random Forest (RF). Experiments on English database show that the new model yields more accurate predictions, compared with the baseline state-based duration model. The average improvement of phone RMSEs are 4.265 ms and 14.6% in English speech synthesis. The perceptual experiments on the same database further confirm that proposed model have a better performance than the baseline model.

机译：结合隐马尔可夫模型和单位选择方法的混合语音合成系统已被广泛应用，并在最近在行业和学术界研究，因为其自然和表现力。然而，用于控制所选候选者的持续时间的目标持续时间仍然通过基于状态的持续时间模型来预测，其性能远非令人满意。结果，合成语音听起来有点平淡，甚至乏味。在本文中，我们用随机林（RF）取代了基于国家的持续时间模型。与基于基线状态的持续时间模型相比，英语数据库的实验表明，新模型会产生更准确的预测。英语语音合成的电话RMSE的平均改善为4.265毫秒和14.6％。同一数据库的感知实验进一步确认提出的模型具有比基线模型更好的性能。

著录项

来源
《International Conference on Affective Computing and Intelligent Interaction》|2015年||共5页
会议地点
作者
Ran Zhang; Xiaoyan Lou; Qinghua Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Hybrid speech synthesis; Duration model; Random forest; Cart;

机译：混合语音合成;持续时间模型;随机森林;购物车;
入库时间 2022-08-21 07:16:08

相似文献

外文文献
中文文献
专利

1. A Hybrid Method for Traffic Incident Duration Prediction Using BOA-Optimized Random Forest Combined with Neighborhood Components Analysis [J] . Shang Qiang, Tan Derong, Gao Song, Journal of Advanced Transportation . 2019,第PTa1期

机译：BOA优化随机森林结合邻域成分分析的交通事故持续时间混合预测方法
2. A Hybrid Method for Traffic Incident Duration Prediction Using BOA-Optimized Random Forest Combined with Neighborhood Components Analysis [J] . Shang Qiang, Tan Derong, Gao Song, Journal of Advanced Transportation . 2019,第PTa1期

机译：使用BOA优化随机林的交通事故持续时间预测混合方法与邻域分量分析相结合
3. A Hybrid Method for Traffic Incident Duration Prediction Using BOA-Optimized Random Forest Combined with Neighborhood Components Analysis [J] . Qiang Shang, Derong Tan, Song Gao, Journal of advanced transportation . 2019,第2期

机译：使用BOA优化随机林的交通事故持续时间预测混合方法与邻域分量分析相结合
4. Duration refinement for hybrid speech synthesis system using random forest [C] . Zhang Ran, Lou Xiaoyan, Wu Qinghua 2015 International Conference on Affective Computing and Intelligent Interaction . 2015

机译：使用随机森林的混合语音合成系统的持续时间细化
5. A scalable hybrid model for health care insurance fraud detection using association rules and random forest. [D] . Alqudah, Mohammad Khaled. 2015

机译：使用关联规则和随机森林进行医疗保险欺诈检测的可扩展混合模型。
6. MRI-based pseudo CT synthesis using anatomical signature and alternating random forest with iterative refinement model [O] . Yang Lei, Jiwoong Jason Jeong, Tonghe Wang, 2018

机译：基于MRI的解剖特征和交替随机森林与迭代细化模型的基于MRI的伪CT合成
7. Comparison of Two Different Text-to-speech Alignment systems: Speech Synthesis based VS. Hybrid HMM/ANN [O] . Deroo O., Malfrere F., Dutoit T. 1998

机译：两种不同的文本到语音对齐系统的比较：基于语音合成的VS。混合HMM / ANN

Duration Refinement for Hybrid Speech Synthesis System using Random Forest

摘要

著录项

相似文献

相关主题

期刊订阅