...
首页> 外文期刊>Signal processing >Speech enhancement using a wavelet thresholding method based on symmetric Kullback-Leibler divergence
【24h】

Speech enhancement using a wavelet thresholding method based on symmetric Kullback-Leibler divergence

机译:基于对称Kullback-Leibler发散的小波阈值化语音增强

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Performance of wavelet thresholding methods for speech enhancement strongly depends on estimating an exact threshold value in the wavelet sub-bands. In this paper, we propose a new method for more exact estimation of the threshold value. Our proposed threshold value is firstly obtained based on the symmetric Kullback-Leibler divergence between the probability distributions of noisy speech and noise wavelet coefficients. In the next step, we improved this value using the segmental Signal-to-Noise Ratio (SNR). We used some TIMIT utterances to assess the performance of the proposed threshold. The algorithm is evaluated using the Perceptual Evaluation of Speech Quality (PESQ) score and the SNR improvement in ideal and real modes. In ideal and real modes, on average, we obtain respectively 2.25 dB and 1 dB SNR improvement and a PESQ score increase up to 1.1, 0.75 compared with the conventional wavelet thresholding approaches. In comparison to the adaptive thresholding approach, on average in ideal and real modes, we obtain respectively 1.6 dB and 0.9 dB SNR improvement The PESQ value of the adaptive thresholding method, in the real and ideal modes, is 025 higher and 0.5 lower than that of our proposed method, respectively.
机译:用于语音增强的小波阈值方法的性能在很大程度上取决于估计小波子带中的确切阈值。在本文中,我们提出了一种用于更精确地估计阈值的新方法。首先,基于噪声语音的概率分布与噪声小波系数之间的对称Kullback-Leibler散度来获得我们提出的阈值。在下一步中,我们使用分段信噪比(SNR)改善了该值。我们使用了一些TIMIT话语来评估建议阈值的性能。使用语音质量的感知评估(PESQ)分数和理想模式和实模式下的SNR改善对算法进行评估。与传统的小波阈值方法相比,在理想模式和实模式下,平均而言,我们分别获得了2.25 dB和1 dB的SNR改善,并且PESQ得分提高了1.1、0.75。与自适应阈值方法相比,理想模式和实模式下的平均SNR改善分别为1.6 dB和0.9 dB。在实模式和理想模式下,自适应阈值方法的PESQ值分别比实模式和理想模式高025和低0.5。我们提出的方法分别。

著录项

  • 来源
    《Signal processing》 |2015年第1期|184-197|共14页
  • 作者单位

    Audio & Speech Processing Lab, Computer Engineering Department, Iran University of Science & Technology, Tehran, Iran,Astronautics Research Institute, Iranian Space Research Center, Aerospace Research Center lane, Mahestan Street, Iran zamin Street, 14665-834, Tehran, Iran;

    Audio & Speech Processing Lab, Computer Engineering Department, Iran University of Science & Technology, Tehran, Iran;

    Audio & Speech Processing Lab, Computer Engineering Department, Iran University of Science & Technology, Tehran, Iran,Electrical and Computer Engineering Department, K.N.Toosi University of Technology, Tehran, Iran;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Speech enhancement; Wavelet thresholding; Kullback-Leibler divergence; Probability distribution;

    机译:语音增强;小波阈值;Kullback-Leibler分歧;概率分布;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号