首页> 外文会议>European conference on speech communication and technology >An Objective Measure for Estimating MOS of Synthesized Speech
【24h】

An Objective Measure for Estimating MOS of Synthesized Speech

机译:综合演讲估计MOS的客观措施

获取原文

摘要

This paper proposes an average concatenative cost function as the objective measure for naturalness of synthesized speech. All its seven component-costs can be derived directly from the input text and the scripts of speech database. A formal Mean Opinion Score (MOS) experiment shows mat the average concatenative cost and its seven components are all highly correlated with MOS obtained subjectively. The correlation coefficient between the objective measure and subjective measure is -0.872. The mean of errors in MOS estimation for individual waveforms is 0.32 with 0.40 RMSE. When estimating the overall MOS for ITS systems, the mean error is smaller than 0.05. With the proposed objective measure, it becomes possible and easy for us to track the performance in naturalness regularly. The proposed cost function could also serve as criteria for optimizing the algorithms for unit selecting and speech database pruning.
机译:本文提出了平均的连续成本函数作为综合演讲自然的客观措施。它的所有七个组件 - 成本都可以直接从输入文本和语音数据库的脚本中派生。正式的平均意见评分(MOS)实验表明,垫子平均连接成本及其七种组分与主体获得的MOS高度相关。客观度量和主观措施之间的相关系数为-0.872。各个波形的MOS估计误差的平均值为0.32,0.40 RMSE。估计其系统的整体MOS时,平均误差小于0.05。通过拟议的客观措施,我们可以易于追踪自然的性能。所提出的成本函数也可以作为优化单位选择和语音数据库修剪的算法的标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号