首页> 外文会议>International Conference on Signal Processing and Communications >Global soft decision based speech enhancement using voiced-unvoiced uncertainty and harmonic phase decomposition technique
【24h】

Global soft decision based speech enhancement using voiced-unvoiced uncertainty and harmonic phase decomposition technique

机译:使用浊音不确定性和谐波相位分解技术的基于全局软判决的语音增强

获取原文

摘要

This paper introduces a single-channel speech enhancement framework based on global soft-decision, where the magnitude spectrum of clean speech is estimated by combining two separate Bayesian estimators based on voiced-unvoiced uncertainty. For the voiced regions, the perceptually motivated adaptive β-order weighted minimum mean square error (MMSE) estimator is employed. On the other hand, for the unvoiced segments, Bayesian estimator derived from modified Itakura-Saito (MIS) cost function is utilized. In addition to this spectral magnitude enhancement, the clean speech is reconstructed in time domain using the estimated clean phase based on a technique involving harmonic phase decomposition and spectro-temporal smoothing filters. The proposed algorithm is simulated under different non-stationary noisy environments at various signal to noise ratio values. The experimental results show that in contrast to other benchmark methods, the proposed speech enhancement method produces enhanced speech signal with negligible musical noise and a joint improvement in both perceived quality and speech intelligibility.
机译:本文介绍了一种基于全局软决策的单通道语音增强框架,该框架通过结合两个基于清浊不确定性的贝叶斯估计量来估计纯净语音的幅度谱。对于发声区域,采用感知动机的自适应β阶加权最小均方误差(MMSE)估计器。另一方面,对于清音段,使用了从修改后的Itakura-Saito(MIS)成本函数得出的贝叶斯估计量。除了这种频谱幅度增强之外,基于包括谐波相位分解和频谱时域平滑滤波器在内的技术,使用估计的干净相位在时域中重构干净的语音。在不同的非平稳噪声环境下,以不同的信噪比值对算法进行了仿真。实验结果表明,与其他基准方法相比,所提出的语音增强方法可以产生增强的语音信号,而音乐噪声却可以忽略不计,并且在感知质量和语音清晰度方面都有共同的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号