首页> 外文会议>International Summer School 'Neural Nets E.R.Caianiello' on Nonlinear Speech Processing: Algorithms and Analysis >Nonlinear Speech Features for the Objective Detection of Discontinuities in Concatenative Speech Synthesis
【24h】

Nonlinear Speech Features for the Objective Detection of Discontinuities in Concatenative Speech Synthesis

机译:无限性地检测连续语音合成中断的非线性语音特征

获取原文
获取外文期刊封面目录资料

摘要

An objective distance measure which is able to predict audible discontinuities in concatenative speech synthesis systems is very important. Previous results showed that linear approaches are not very effective to detect audible discontinuities. The best result was obtained by using the Kullback-Leibler distance on power spectra with the rate of 37%. In this paper, we present two nonlinear approaches for the detection of discontinuities. The first method is based on a nonlinear harmonic model for speech while the second method is based on the demodulation of speech in an amplitude and a frequency component using the Teager energy operator. Results show that detection rate can exceed 70%, which is an improvement of about 95% over previous published results.
机译:能够预测连接性语音合成系统中可听的不连续性的客观距离测量非常重要。以前的结果表明,线性方法不太有效地检测听见的不连续性。通过在功率谱上使用Qullback-Leibler距离获得的最佳结果,其速率为37%。在本文中,我们提出了两种用于检测不连续性的非线性方法。第一方法基于非线性谐波模型用于语音,而第二种方法基于使用Texer能量操作员的幅度和频率分量中的语音解调。结果表明,检出率可能超过70%,这是在以前公布的结果上的提高约95%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号