Perceptual evaluation of voice source models

机译：语音源模型的感知评估

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Models of the voice source differ in their fits to natural voices, but it is unclear which differences in fit are perceptually salient. This study examined the relationship between the fit of five voice source models to 40 natural voices, and the degree of perceptual match among stimuli synthesized with each of the modeled sources. Listeners completed a visual sort-and-rate task to compare versions of each voice created with the different source models, and the results were analyzed using multidimensional scaling. Neither fits to pulse shapes nor fits to landmark points on the pulses predicted observed differences in quality. Further, the source models fit the opening phase of the glottal pulses better than they fit the closing phase, but at the same time similarity in quality was better predicted by the timing and amplitude of the negative peak of the flow derivative (part of the closing phase) than by the timing and/or amplitude of peak glottal opening. Results indicate that simply knowing how (or how well) a particular source model fits or does not fit a target source pulse in the time domain provides little insight into what aspects of the voice source are important to listeners.

机译：语音源的模型在适合自然声音方面会有所不同，但尚不清楚哪些适合的差异在感知上很明显。这项研究检查了五个声音源模型对40种自然声音的适合度以及与每个模型源合成的刺激之间的知觉匹配程度之间的关系。听众完成了视觉上的排序和排序任务，以比较使用不同源模型创建的每种语音的版本，并使用多维缩放对结果进行分析。既不适合于脉冲形状，也不适合于预测的观察到的质量差异的脉冲上的界标点。此外，源模型比声门脉冲的打开相位更适合于声门脉冲，但同时，通过流量导数的负峰的时间和幅度更好地预测了质量相似性（部分闭合）相位），而不是声门打开峰值的时间和/或幅度。结果表明，仅了解特定源模型在时域中适合（或不适合）目标源脉冲的程度（或不适合），就无法深入了解语音源的哪些方面对听众很重要。

著录项

期刊名称 The Journal of the Acoustical Society of America
作者
Jody Kreiman; Marc Garellek; Gang Chen; Abeer Alwan; Bruce R. Gerratt;
展开▼
作者单位

展开▼
年(卷),期 -1(138),1
年度 -1
页码 1–10
总页数 10
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Perceptual evaluation of voice source models [J] . Kreiman Jody, Garellek Marc, Chen Gang, The Journal of the Acoustical Society of America . 2015,第1期

机译：语音源模型的感知评估
2. Perceptual evaluation of severe pediatric voice disorders: rater reliability using the consensus auditory perceptual evaluation of voice. [J] . Kelchner LN, Brehm SB, Weinrich B, Journal of voice: official journal of the Voice Foundation . 2010,第4期

机译：严重儿科语音障碍的知觉评估：使用语音的共识听觉知觉评估者的信度。
3. Perceptual Evaluation of Dysphonic Voices: Can a Training Protocol Lead to the Development of Perceptual Categories? [J] . Ghio Alain, Dufour Sophie, Wengler Aude, Journal of voice: official journal of the Voice Foundation . 2015,第3期

机译：口头语音的感知评估：培训协议是否可以导致感知类别的发展？
4. A perceptually and physiologically motivated voice source model [C] . Gang Chen, Marc Garellek, Jody Kreiman, Conference of the International Speech Communication Association . 2013

机译：一种感知和生理上动机的语音源模型
5. Design and evaluation of real-time voice-over-IP (VoIP) systems with high perceptual conversational quality. [D] . Sat, Batu. 2010

机译：具有高感知对话质量的实时IP语音（VoIP）系统的设计和评估。
6. Perceptual interaction of the harmonic source and noise in voice [O] . Jody Kreiman, Bruce R. Gerratt -1

机译：谐波源与语音中噪声的感知交互
7. Perceptual Evaluation of Dysphonic Voices: Can a Training Protocol Lead to the Development of Perceptual Categories? [O] . Ghio, Alain, Dufour, Sophie, Wengler, Aude, 2015

机译：口头语音的感知评估：培训协议是否可以导致感知类别的发展？

Perceptual evaluation of voice source models

摘要

著录项

相似文献

相关主题

期刊订阅