首页> 外文会议>Audio Engineering Society Convention >On the physiological validity of the group delay response of all-pole vocal tract modeling
【24h】

On the physiological validity of the group delay response of all-pole vocal tract modeling

机译:关于全极声道建模的群体延迟响应的生理有效性

获取原文

摘要

Magnitude-oriented approaches dominate the voice analysis front-ends of most current technologies addressing e.g. speaker identification, speech coding/compression, voice reconstruction and re-synthesis. A popular technique is all-pole vocal tract modeling. The phase response of all-pole models is known to be non-linear and highly dependent on the magnitude frequency response. In this paper, we use a shift-invariant phase-related feature that is estimated from signal harmonics in order to study the impact of all-pole models on the phase structure of voiced sounds. We relate that impact to the phase structure that is found in natural voiced sounds to conclude on the physiological validity of the group delay of all-pole vocal tract modeling. Our findings emphasize that harmonic phase models are idiosyncratic, and this is important in speaker identification, and in fostering the quality and naturalness of synthetic and reconstructed speech.
机译:面向幅度的方法主导语音分析大多数当前技术的正端。扬声器识别,语音编码/压缩,语音重建和重新合成。一种流行的技术是全极声道模拟。已知全极模型的相位响应是非线性的,高度依赖于幅度频率响应。在本文中,我们使用从信号谐波估计的移位不变相位相关的特征,以便研究全极模型对浊音的相位结构的影响。我们涉及对天然浊音的相位结构的影响,以得出全极声道模拟集团延迟的生理有效性。我们的调查结果强调,谐波相模型是特殊的,这在扬声器识别方面很重要,促进合成和重建语音的质量和自然。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号