【2h】

Perceptual evaluation of voice source models

机译:语音源模型的感知评估

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Models of the voice source differ in their fits to natural voices, but it is unclear which differences in fit are perceptually salient. This study examined the relationship between the fit of five voice source models to 40 natural voices, and the degree of perceptual match among stimuli synthesized with each of the modeled sources. Listeners completed a visual sort-and-rate task to compare versions of each voice created with the different source models, and the results were analyzed using multidimensional scaling. Neither fits to pulse shapes nor fits to landmark points on the pulses predicted observed differences in quality. Further, the source models fit the opening phase of the glottal pulses better than they fit the closing phase, but at the same time similarity in quality was better predicted by the timing and amplitude of the negative peak of the flow derivative (part of the closing phase) than by the timing and/or amplitude of peak glottal opening. Results indicate that simply knowing how (or how well) a particular source model fits or does not fit a target source pulse in the time domain provides little insight into what aspects of the voice source are important to listeners.
机译:语音源的模型在适合自然声音方面会有所不同,但尚不清楚哪些适合的差异在感知上很明显。这项研究检查了五个声音源模型对40种自然声音的适合度以及与每个模型源合成的刺激之间的知觉匹配程度之间的关系。听众完成了视觉上的排序和排序任务,以比较使用不同源模型创建的每种语音的版本,并使用多维缩放对结果进行分析。既不适合于脉冲形状,也不适合于预测的观察到的质量差异的脉冲上的界标点。此外,源模型比声门脉冲的打开相位更适合于声门脉冲,但同时,通过流量导数的负峰的时间和幅度更好地预测了质量相似性(部分闭合)相位),而不是声门打开峰值的时间和/或幅度。结果表明,仅了解特定源模型在时域中适合(或不适合)目标源脉冲的程度(或不适合),就无法深入了解语音源的哪些方面对听众很重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号