首页> 外文会议>Audio Engineering Society international convention >On the physiological validity of the group delay response of all-pole vocal tract modeling
【24h】

On the physiological validity of the group delay response of all-pole vocal tract modeling

机译:全极声道建模的群延迟反应的生理有效性

获取原文

摘要

Magnitude-oriented approaches dominate the voice analysis front-ends of most current technologies addressing e.g. speaker identification, speech coding/compression, voice reconstruction and re-synthesis. A popular technique is all-pole vocal tract modeling. The phase response of all-pole models is known to be non-linear and highly dependent on the magnitude frequency response. In this paper, we use a shift-invariant phase-related feature that is estimated from signal harmonics in order to study the impact of all-pole models on the phase structure of voiced sounds. We relate that impact to the phase structure that is found in natural voiced sounds to conclude on the physiological validity of the group delay of all-pole vocal tract modeling. Our findings emphasize that harmonic phase models are idiosyncratic, and this is important in speaker identification, and in fostering the quality and naturalness of synthetic and reconstructed speech.
机译:面向幅度的方法在解决例如说话人识别,语音编码/压缩,语音重建和重新合成。一种流行的技术是全极声道建模。众所周知,全极点模型的相位响应是非线性的,并且高度依赖于幅度频率响应。在本文中,我们使用从信号谐波估计的与相位不变的相位相关的特征,以研究全极点模型对浊音的相位结构的影响。我们将这种影响与自然浊音中发现的相位结构联系起来,以得出全极声道模型的群延迟的生理有效性的结论。我们的研究结果强调,谐波相位模型是特质的,这在说话人识别以及提高合成和重构语音的质量和自然性方面很重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号