首页> 外文会议>IEEE International Conference on Engineering Technologies and Applied Sciences >Estimation of Fundamental Frequency of Noisy Speech Signals using Correlogram based on Subband Filtering
【24h】

Estimation of Fundamental Frequency of Noisy Speech Signals using Correlogram based on Subband Filtering

机译:基于子带滤波的相关图估计噪声语音信号的基本频率

获取原文

摘要

Estimation of the fundamental frequency (FO) is a difficult task under the colored noise environment. In this work, an algorithm for estimation of the fundamental frequency of speech signal is proposed based on multi-channel comb filtering and correlogram for noisy speech signals. The noisy speech signal is decomposed into four bands using wavelet transform. The comb filter is applied on each decomposed subband signal to enhance the peaks in each correlogram. For obtaining the correlograms, the autocorrelation is computed for each subband. The comb filter is defined using the raised cosine function with interpeak frequency ranges from 25 to 500 Hz. Comb filter channels are needed to match the harmonics of the fundamental frequency and these channels are selected based on the harmonic to subharmonic ratio. The proposed method is evaluated using the CSTR database. The performance of the proposed method is compared with the RAPT, YIN, Cepstrum and SHR algorithms in terms of gross pitch error (GPE). The simulation results show the performance of the proposed algorithm is superior in different noise environments compared to existing methods.
机译:在有色噪声环境下,基频(FO)的估算是一项艰巨的任务。在这项工作中,提出了一种基于多通道梳状滤波和相关图的语音信号基频估计算法。使用小波变换将嘈杂的语音信号分解为四个频带。梳状滤波器应用于每个分解后的子带信号,以增强每个相关图中的峰值。为了获得相关图,为每个子带计算自相关。梳状滤波器是使用提升的余弦函数定义的,其峰间频率范围为25至500 Hz。需要梳状滤波器通道来匹配基本频率的谐波,并且这些通道是根据谐波与次谐波的比率来选择的。所提出的方法是使用CSTR数据库进行评估的。在总音调误差(GPE)方面,将所提方法的性能与RAPT,YIN,倒谱和SHR算法进行了比较。仿真结果表明,与现有方法相比,该算法在不同噪声环境下的性能优越。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号