首页> 外文会议>Annual Conference of the International Speech Communication Association >A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks
【24h】

A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks

机译:使用神经网络的合成语音自然的分层预测因子

获取原文

摘要

A problem when developing and tuning speech synthesis systems is that there is no well-established method of automatically rating the quality of the synthetic speech. This research attempts to obtain a new automated measure which is trained on the result of large-scale subjective evaluations employing many human listeners, i.e., the Blizzard Challenge. To exploit the data, we experiment with linear regression, feed-forward and convolutional neural network models, and combinations of them to regress from synthetic speech to the perceptual scores obtained from listeners. The biggest improvements were seen when combining stimulus- and system-level predictions.
机译:开发和调整语音合成系统的问题是,没有自动评估合成语音的质量的既定方法。 该研究试图获得新的自动化措施,这些措施培训,这些措施受到许多人类听众的大规模主观评估的结果,即暴风雪挑战。 为了利用数据,我们尝试线性回归,前馈和卷积神经网络模型,以及它们与从侦听器获得的感知分数的回归的组合。 结合刺激和系统级预测时,可以看到最大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号