首页> 外文会议>International Conference on Speech and Computer >Improvements to Prosodic Variation in Long Short-Term Memory Based Intonation Models Using Random Forest
【24h】

Improvements to Prosodic Variation in Long Short-Term Memory Based Intonation Models Using Random Forest

机译:使用随机林的长短期内存内部内部模型的韵律变化改进

获取原文

摘要

Statistical parametric speech synthesis has overcome unit selection methods in many aspects, including flexibility and variability. However, the intonation of these systems is quite monotonic, especially in case of longer sentences. Due to statistical methods the variation of fundamental frequency (F0) trajectories decreases. In this research a random forest (RF) based classifier was trained with radio conversations based on the perceived variation by a human annotator. This classifier was used to extend the labels of a phonetically balanced, studio quality speech corpus. With the extended labels a Long Short-Term Memory (LSTM) network was trained to model fundamental frequency (F0). Objective and subjective evaluations were carried out. The results show that the variation of the generated F0 trajectories can be fine-tuned with an additional input of the LSTM network.
机译:统计参数语音合成在许多方面克服了单位选择方法,包括灵活性和可变性。然而,这些系统的语调是相当单调的,特别是在更长的句子的情况下。由于统计方法,基本频率(F0)轨迹的变化降低。在该研究中,基于随机森林(RF)的分类器,基于人类注释器的感知变化,通过无线电对话训练。该分类器用于扩展语音平衡,工作室质量语音语料库的标签。使用扩展标签,长期短期内存(LSTM)网络培训以模拟基波频率(F0)。目的和主观评估进行。结果表明,使用LSTM网络的额外输入可以进行微调的F0轨迹的变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号