首页> 外文会议>International Conference on Speech and Computer >Synthetic Speech Evaluation by Differential Maps in Pleasure-Arousal Space
【24h】

Synthetic Speech Evaluation by Differential Maps in Pleasure-Arousal Space

机译:差分地图在乐趣唤醒空间中的综合语音评估

获取原文

摘要

The paper deals with automatic evaluation of the quality of synthetic speech using Gaussian mixture models (GMM) for classification in the Pleasure-Arousal (P-A) scale and subsequently calculated 2D and 3D P-A differentials maps. The speech synthesized from the voice of a speaker is compared with the original voice of the same speaker. Three methods of speech synthesis are ordered by descending 3D perceptual distances from the original speech material. Basic experiments confirm the principal functionality of the developed system. The detailed analysis shows a great influence of the number of mixture components, the size of the processed speech material, and the type of the database for GMM creation on partial results of the continual P-A detection and the final results. The objective evaluation results are finally compared with the subjective ratings by human evaluators.
机译:本文涉及使用高斯混合模型(GMM)进行综合演讲质量的自动评估,以便在令人愉悦的令人乐趣(P-A)尺度和随后计算的2D和3D P-A差分地图中的分类。与扬声器的声音合成的语音与同一扬声器的原始声音进行比较。通过从原始语音材料下降3D感知距离来命令三种语音合成方法。基本实验证实了开发系统的主要功能。详细分析显示了混合组件的数量,加工语音材料的大小和数据库类型对GMM创建的数据库的类型的影响很大,而是在持续的P-A检测和最终结果的部分结果上。目标评估结果最终与人类评估人员的主观评级进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号