首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >DECOMPOSITION OF SPEECH INTO VOICED AND UNVOICED COMPONENTS BASED ON A STATE-SPACE SIGNAL MODEL
【24h】

DECOMPOSITION OF SPEECH INTO VOICED AND UNVOICED COMPONENTS BASED ON A STATE-SPACE SIGNAL MODEL

机译:基于状态空间信号模型将语音分解成浊音和清音组件

获取原文

摘要

We present a novel method for decomposing speech into voiced and unvoiced components. After demodulating variations in spectral envelope, energy and pitch, the method involves applying a bank of Kalman filters to separate the harmonic and non-harmonic components of the signal. This approach relies on a state-space representation of the composite signal, and provides a way to accurately estimate the harmonic component without the large delay required by a linear phase comb filter. However it also requires prior knowledge of the variance of the unvoiced component and the state transition parameters. We present a novel method to accurately determine these parameters based on a variant of the Expectation-Maximization algorithm. Modifications for dealing with unvoiced segments and voicing onset are also described.
机译:我们提出了一种将语音分解成浊音和清音组件的新方法。在解调光谱包络,能量和间距的变化之后,该方法涉及应用卡尔曼滤波器的银行来分离信号的谐波和非谐波分量。该方法依赖于复合信号的状态表示,并且提供了一种方法来准确地估计谐波分量,而不通过线相梳滤波器所需的大延迟。然而,它还需要先验知识了解无声组件的方差和状态转换参数。我们提出了一种基于期望最大化算法的变体来准确地确定这些参数的新方法。还描述了处理解释段和发作发作的修改。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号