首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Time-domain approach using multiple Kalman filters and EM algorithmto speech enhancement with nonstationary noise
【24h】

Time-domain approach using multiple Kalman filters and EM algorithmto speech enhancement with nonstationary noise

机译:使用多个卡尔曼滤波器和EM算法的时域方法用于非平稳噪声的语音增强

获取原文
获取原文并翻译 | 示例
           

摘要

A time-domain approach for enhancing speech signals degraded by statistically independent additive nonstationary noise with no a priori information is developed. The autoregressive (AR)-hidden filter model (HFM) with gain contour is proposed for modeling the statistical characteristics of the clean speech signal. Given the HFM parameter set of the speech, speech enhancement becomes a set of problems of joint signal estimation for clean speech and system identification for the gain contour and time-varying parameter of noise. Then, the expectation-maximization (EM) algorithm is applied to signal estimation and system identification. In the E-step, the signal estimation becomes a weighted sum of conditional mean estimator using multiple Kalman filters with Markovian switching coefficient, where the weights equal to a posteriori probabilities of the specific state sequence history given the noisy speech. The probability is computed by the Viterbi algorithm (VA). In M-step, the gain contour and noise parameters are recursively updated by an adaptive algorithm modified from the gradient-based algorithm. The proposed method does not require framing of speech signal in, the train and enhancement procedure. The proposed method is tested against the noisy speech signals degraded by nonstationary noise at various input signal-to-noise ratios. An approximate improvement of 4.5-6.0 dB in signal-to-noise ratio (SNR) is achieved at the input SNR 10 and 15 dB
机译:开发了一种时域方法,用于增强没有统计信息的统计独立的加性非平稳噪声导致的语音信号退化。提出了具有增益轮廓的自回归(AR)隐藏滤波器模型(HFM),用于对干净语音信号的统计特性进行建模。给定语音的HFM参数集,语音增强成为清洁语音的联合信号估计以及噪声的增益轮廓和时变参数的系统识别的一系列问题。然后,将期望最大化算法应用于信号估计和系统识别。在E步中,使用具有马尔可夫切换系数的多个卡尔曼滤波器,信号估计成为条件均值估计器的加权和,其中权重等于在给定语音噪声的情况下特定状态序列历史的后验概率。概率由维特比算法(VA)计算。在M步中,通过从基于梯度的算法修改而来的自适应算法来递归更新增益轮廓和噪声参数。所提出的方法不需要在训练和增强过程中对语音信号进行成帧。针对各种输入信噪比下的非平稳噪声导致的有声语音信号进行了测试。在输入SNR 10和15 dB时,信噪比(SNR)大约提高了4.5-6.0 dB

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号