首页> 外文期刊>Journal of Intelligent Systems >Time-Order Representation Based Method for Epoch Detection from Speech Signals
【24h】

Time-Order Representation Based Method for Epoch Detection from Speech Signals

机译:基于时间顺序表示的语音信号历元检测方法

获取原文
获取原文并翻译 | 示例
       

摘要

Epochs present in the voiced speech are defined as time instants of significant excitation of the vocal tract system during the production of speech. Nonstationary nature of excitation source and vocal tract system makes accurate identification of epochs a difficult task. Most of the existing methods for epoch detection require prior knowledge of voiced regions and a rough estimation of pitch frequency. In this paper, we propose a novel method that relies on time-order representation (TOR) based on short-time Fourier-Bessel (FB) series expansion which can be employed on entire speech signal to detect epochs without any prior information. The proposed method automatically detects voiced regions in the speech signal by computing the marginal energy density with respect to time in the low frequency range (LFR) from the energy distribution in the time-frequency plane. An estimate of pitch frequency for each detected voiced region is then obtained by computing the marginal energy density with respect to frequency in the LFR from the energy distribution in the time-frequency plane. Epochs are located for each detected voiced region as peaks in the derivative of the low pass filtered (LPF) signal corresponding to falling edges of peak negative cycles in the LPF signal synthesized from TOR coefficients corresponding to LFR. Experimental results obtained by the proposed method on speech signals taken from the CMU-Arctic database are found to be promising. The proposed method detects epochs with high accuracy and reliability.
机译:语音中出现的时期被定义为在语音产生过程中声道系统明显激发的时刻。激励源和声道系统的非平稳性质使准确识别历元成为一项艰巨的任务。用于历元检测的大多数现有方法需要对浊音区域的先验知识和音调频率的粗略估计。在本文中,我们提出了一种基于短时傅立叶-贝塞尔(FB)级数扩展的基于时间顺序表示(TOR)的新颖方法,该方法可用于整个语音信号以检测历元而无需任何先验信息。所提出的方法通过根据时频平面中的能量分布计算低频范围(LFR)中相对于时间的边际能量密度,从而自动检测语音信号中的浊音区域。然后,通过从时频平面中的能量分布计算相对于LFR中频率的边际能量密度,可以获得每个检测到的浊音区域的基音频率估计。针对每个检测到的浊音区域定位历元,作为低通滤波(LPF)信号的导数中的峰值,该峰值对应于由对应于LFR的TOR系数合成的LPF信号中的峰值负周期的下降沿。发现该方法对从CMU-Arctic数据库获取的语音信号的实验结果很有希望。所提出的方法以高精度和可靠性来检测历元。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号