On the physiological validity of the group delay response of all-pole vocal tract modeling

机译：全极声道建模的群延迟反应的生理有效性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Magnitude-oriented approaches dominate the voice analysis front-ends of most current technologies addressing e.g. speaker identification, speech coding/compression, voice reconstruction and re-synthesis. A popular technique is all-pole vocal tract modeling. The phase response of all-pole models is known to be non-linear and highly dependent on the magnitude frequency response. In this paper, we use a shift-invariant phase-related feature that is estimated from signal harmonics in order to study the impact of all-pole models on the phase structure of voiced sounds. We relate that impact to the phase structure that is found in natural voiced sounds to conclude on the physiological validity of the group delay of all-pole vocal tract modeling. Our findings emphasize that harmonic phase models are idiosyncratic, and this is important in speaker identification, and in fostering the quality and naturalness of synthetic and reconstructed speech.

机译：面向幅度的方法在解决例如说话人识别，语音编码/压缩，语音重建和重新合成。一种流行的技术是全极声道建模。众所周知，全极点模型的相位响应是非线性的，并且高度依赖于幅度频率响应。在本文中，我们使用从信号谐波估计的与相位不变的相位相关的特征，以研究全极点模型对浊音的相位结构的影响。我们将这种影响与自然浊音中发现的相位结构联系起来，以得出全极声道模型的群延迟的生理有效性的结论。我们的研究结果强调，谐波相位模型是特质的，这在说话人识别以及提高合成和重构语音的质量和自然性方面很重要。

著录项

来源
《Audio Engineering Society international convention》|2018年|295-309|共15页
会议地点
作者
Anibal J. Ferreira;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:33:57

相似文献

外文文献
中文文献
专利

1. Vocal-tract modeling: fractional elongation of segment lengths in a waveguide model with half-sample delays [J] . Mathur S., Story B.H., Rodriguez J.J. IEEE transactions on audio, speech and language processing . 2006,第5期

机译：声道建模：具有半采样延迟的波导模型中段长度的分数伸长
2. On the ability of a physiologically constrained area function model of the vocal tract to produce normal formant patterns under perturbed conditions [J] . Brad H. Story The Journal of the Acoustical Society of America . 2004,第4期

机译：在扰动条件下声道的生理受限区域功能模型产生正常共振峰模式的能力
3. Modeling individual vocal differences in group-living lemurs using vocal tract morphology [J] . Marco Gamba, Livio Favaro, Alessandro Araldi, Current zoology . 2017,第4期

机译：使用声道形态学模拟生活在群居的狐猴中的个体声音差异
4. On the physiological validity of the group delay response of all-pole vocal tract modeling [C] . Anibal J. Ferreira Audio Engineering Society Convention . 2018

机译：关于全极声道建模的群体延迟响应的生理有效性
5. Post-exercise responses during treatment delays do not affect the physiological responses to cooling in cold water in hyperthermic individuals . [D] . Carlson, Mark. 2013

机译：运动延迟期间的运动后反应不会影响高热个体对冷水中冷却的生理反应。
6. Modeling individual vocal differences in group-living lemurs using vocal tract morphology [O] . Marco Gamba, Livio Favaro, Alessandro Araldi, 2017

机译：使用声道形态学模拟生活在群居的狐猴中的个体声音差异
7. Group Delay Function from All-Pole Models for Musical Instrument Recognition [O] . Aleksandr Diment, Padmanabhan Rajan, Toni Heittola, 2014

机译：来自乐器识别的全极模型的组延迟功能
8. The Identifiability of Approximate Vocal Tract Impulse Response Magnitudes. [R] . McKendree, F. S. 1977

机译：近似声带脉冲响应幅度的可识别性。

On the physiological validity of the group delay response of all-pole vocal tract modeling

摘要

著录项

相似文献

相关主题

期刊订阅