首页> 外文期刊>Applied Acoustics >Speech enhancement based on the discrete Gabor transform and multi-notch adaptive digital filters
【24h】

Speech enhancement based on the discrete Gabor transform and multi-notch adaptive digital filters

机译:基于离散Gabor变换和多陷波自适应数字滤波器的语音增强

获取原文
获取原文并翻译 | 示例
       

摘要

This paper presents a new method to speech enhancement based on time-frequency analysis and adaptive digital filtering. The proposed method for dual-channel speech enhancement was developed by tracking frequencies of corrupting signal by the discrete Gabor transform (DGT) and implementing multi-notch adaptive digital filter (MNADF) at those frequencies. Since no a priori knowledge of the noise source statistics is required this method differs from traditional speech enhancement methods. Specifically, the proposed method was applied to the case where speech quality and intelligibility deteriorate in the presence of background noise. Speech coders and automatic speech recognition (ASR) systems are designed to act on clean speech signals. Therefore, corrupted speech signals by the noise must be enhanced before their processing. The method uses a primary input containing the corrupted speech signal while a reference input containing the noise only. In this paper, we designed MNADF instead of single-notch adaptive digital filter and used DGT to track frequencies of corrupting signal because fast filtering process and fast measure of the time-dependent noise frequency are of great importance in speech enhancement process. Therefore, MNADF was implemented to take advantage of fast filtering process. Different types of noises from Noisex-92 database were used to degrade real speech signals. Objective measures, the study of the speech spectrograms and global signal-to-noise ratio (SNR), segmental SNR (segSNR), Itakura-Saito distance measure as well as subjective listing test demonstrated consistently superior enhancement performance of the proposed method over traditional speech enhancement method such as spectral subtraction. Combining MNADF and DGT, excellent speech enhancement was obtained.
机译:本文提出了一种基于时频分析和自适应数字滤波的语音增强新方法。通过使用离散Gabor变换(DGT)跟踪损坏信号的频率并在这些频率上实现多陷波自适应数字滤波器(MNADF),开发了提出的双通道语音增强方法。由于不需要噪声源统计的先验知识,因此该方法不同于传统的语音增强方法。具体而言,将所提出的方法应用于在存在背景噪声的情况下语音质量和清晰度下降的情况。语音编码器和自动语音识别(ASR)系统设计为对纯净语音信号起作用。因此,在处理之前,必须增强被噪声破坏的语音信号。该方法使用包含损坏的语音信号的主要输入,而仅包含噪声的参考输入。本文中,我们设计了MNADF而不是单陷波自适应数字滤波器,并使用DGT来跟踪失真信号的频率,因为快速滤波过程和快速测量随时间变化的噪声频率在语音增强过程中非常重要。因此,实施MNADF是为了利用快速过滤过程。使用来自Noisex-92数据库的不同类型的噪声来降低真实语音信号的质量。客观测量,语音频谱图和整体信噪比(SNR),分段SNR(segSNR),Itakura-Saito距离测量以及主观列表测试的研究表明,该方法始终具有优于传统语音的增强性能光谱减法等增强方法。结合MNADF和DGT,获得了出色的语音增强效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号