...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Dynamic Speech Spectrum Representation and Tracking Variable Number of Vocal Tract Resonance Frequencies With Time-Varying Dirichlet Process Mixture Models
【24h】

Dynamic Speech Spectrum Representation and Tracking Variable Number of Vocal Tract Resonance Frequencies With Time-Varying Dirichlet Process Mixture Models

机译:时变Dirichlet过程混合模型的动态语音频谱表示和声道共振频率的可变数目跟踪

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a new approach for dynamic speech spectrum representation and tracking vocal tract resonance (VTR) frequencies. The method involves representing the spectral density of the speech signals as a mixture of Gaussians with unknown number of components for which time-varying Dirichlet process mixture model (DPM) is utilized. In the resulting representation, the number of formants is allowed to vary in time. The paper first presents an analysis on the continuity of the formants in the spectrum during the speech utterance. The analysis is based on a new state space representation of concatenated tube model. We show that the number of formants which appear in the spectrum is directly related to the location of the constriction of the vocal tract (i.e., the location of the excitation). Moreover, the disappearance of the formants in the spectrum is explained by ldquouncontrollable modesrdquo of the state space model. Under the assumption of existence of varying number of formants in the spectrum, we propose the use of a DPM model based multi-target tracking algorithm for tracking unknown number of formants. The tracking algorithm defines a hierarchical Bayesian model for the unknown formant states and the inference is done via Rao-Blackwellized particle filter.
机译:在本文中,我们提出了一种用于动态语音频谱表示和跟踪声道共振(VTR)频率的新方法。该方法涉及将语音信号的频谱密度表示为具有未知数量的分量的高斯混合信号,为此使用时变Dirichlet过程混合模型(DPM)。在结果表示中,共振峰的数量允许随时间变化。本文首先对语音发声期间共振峰在频谱中的连续性进行了分析。该分析基于级联管模型的新状态空间表示。我们表明,出现在频谱中的共振峰的数量与声带收缩的位置(即激发的位置)直接相关。此外,共振峰在频谱中的消失是由状态空间模型的“不可控制的模式”来解释的。在频谱中存在数量不等的共振峰的假设下,我们建议使用基于DPM模型的多目标跟踪算法来跟踪未知数目的共振峰。跟踪算法为未知共振峰状态定义了分层贝叶斯模型,并通过Rao-Blackwellized粒子滤波器进行推理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号