首页> 外文会议> >New objective distance measures for spectral discontinuities in concatenative speech synthesis
【24h】

New objective distance measures for spectral discontinuities in concatenative speech synthesis

机译:级联语音合成中频谱不连续性的新客观距离度量

获取原文

摘要

The quality of unit selection based concatenative speech synthesis mainly depends on how well two successive units can be joined together to minimise the audible discontinuities. The objective measure of discontinuity used when selecting units is known as the join cost. The ideal join cost measures perceived discontinuity, based on easily measurable spectral properties of the units being joined, in order to ensure smooth and natural-sounding synthetic speech. In this paper we describe a perceptual experiment conducted to measure the correlation between subjective human perception and various objective spectrally-based measures proposed in the literature. Also we report new objective distance measures derived from various distance metrics based on these spectral features, which have good correlation with human perception to concatenation discontinuities. Our experiments used a state-of-the art unit-selection text-to-speech system: rVoice from Rhetorical Systems Limited.
机译:基于单元选择的级联语音合成的质量主要取决于两个连续单元可以结合在一起的程度,以最大程度地减少可听见的不连续性。选择单元时使用的不连续性的客观度量称为连接成本。理想的连接成本基于被连接单元的易于测量的频谱特性来衡量感知到的不连续性,以确保合成语音流畅自然。在本文中,我们描述了一种感知实验,旨在测量主观人类感知与文献中提出的各种基于客观光谱的测量之间的相关性。我们还报告了基于这些频谱特征从各种距离量度得出的新的客观距离量度,这些距离量度与人类对级联不连续性的感知具有良好的相关性。我们的实验使用了最先进的单元选择文本转语音系统:Rhetorical Systems Limited的rVoice。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号