首页> 外文期刊>Computer speech and language >Measuring speech quality for text-to-speech systems: development and assessment of a modified mean opinion score (MOS) scale
【24h】

Measuring speech quality for text-to-speech systems: development and assessment of a modified mean opinion score (MOS) scale

机译:测量文本到语音系统的语音质量:修改平均意见评分(MOS)量表的开发和评估

获取原文
获取原文并翻译 | 示例
           

摘要

The quality of text-to-speech systems can be effectively assessed only on the basis of reliable and valid listening tests to assess overall system performance. A mean opinion scale (MOS) has been the recommended measure of synthesized speech quality [ITU-T Recommendation P.85, 1994. Telephone transmission quality subjective opinion tests. A method for subjective performance assessment of the quality of speech voice output devices]. We assessed this MOS scale and developed and tested a modified measure of speech quality. This modified measure has new items specific to text-to-speech systems. Our research was motivated by the lack of clear evidence of the conceptual content of as well as the psychometric properties of the MOS scale. We present conceptual arguments and empirical evidence for the reliability and validity of a modified scale. Moreover, we employ state of the art psychometric techniques such as confirmatory factor analysis to provide strong tests of psychometric properties. This modified scale is better suited to appraise synthesis systems since it includes items that are specific to the artifacts found in synthesized speech. We believe that the speech synthesis research communities will find this modified scale a better fit for listening tests to assess synthesized speech.
机译:仅在可靠且有效的听力测试以评估整体系统性能的基础上,才能有效评估文本语音转换系统的质量。建议采用平均意见量表(MOS)来衡量合成语音质量[ITU-T P.85建议书,1994年。电话传输质量主观意见测试。语音输出设备质量的主观性能评估方法]我们评估了此MOS规模,并开发并测试了语音质量的改进量度。此修改后的度量标准具有针对文本语音转换系统的新项目。我们的研究是由于缺乏关于MOS量表的概念性内容和心理计量特性的明确证据而展开的。我们提出了概念上的论据和经验证据,证明了量表的可靠性和有效性。此外,我们采用最先进的心理测量技术,例如确认性因素分析,以提供对心理测量特性的强大测试。这种修改后的量表更适合于评估合成系统,因为它包括特定于合成语音中发现的伪像的项目。我们相信语音合成研究社区将发现这种修改后的量表更适合用于听力测试来评估合成语音。

著录项

  • 来源
    《Computer speech and language》 |2005年第1期|p. 55-83|共29页
  • 作者单位

    IBM Thomas J. Watson Research Center, P.O. Box 218, 1101 Kitchawan Road, Route 134, Yorktown Heights, NY 10598, USA;

    Department of Business Administration, University of Illinois at Urbana-Champaign, 61 Wohlers Hall, MC-706, 1206 S. Sixth Street, Champaign, IL 61820, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号