首页> 外文期刊>ETRI journal >Two-Microphone Binary Mask Speech Enhancement in Diffuse and Directional Noise Fields
【24h】

Two-Microphone Binary Mask Speech Enhancement in Diffuse and Directional Noise Fields

机译:扩散和定向噪声场中的两麦克风二进位面罩语音增强

获取原文
获取原文并翻译 | 示例
       

摘要

Two-microphone binary mask speech enhancement (2mBMSE) has been of particular interest in recent literature and has shown promising results. Current 2mBMSE systems rely on spatial cues of speech and noise sources. Although these cues are helpful for directional noise sources, they lose their efficiency in diffuse noise fields. We propose a new system that is effective in both directional and diffuse noise conditions. The system exploits two features. The first determines whether a given time-frequency (T-F) unit of the input spectrum is dominated by a diffuse or directional source. A diffuse signal is certainly a noise signal, but a directional signal could correspond to a noise or speech source. The second feature discriminates between T-F units dominated by speech or directional noise signals. Speech enhancement is performed using a binary mask, calculated based on the proposed features. In both directional and diffuse noise fields, the proposed system segregates speech T-F units with hit rates above 85%. It outperforms previous solutions in terms of signal-to-noise ratio and perceptual evaluation of speech quality improvement, especially in diffuse noise conditions.
机译:在最近的文献中,两麦克风二进制掩码语音增强(2mBMSE)引起了人们的极大兴趣,并显示出令人鼓舞的结果。当前的2mBMSE系统依赖于语音和噪声源的空间提示。尽管这些提示对于定向噪声源很有帮助,但它们在漫射噪声场中失去了效率。我们提出了一种在定向噪声和扩散噪声条件下均​​有效的新系统。该系统具有两个功能。第一个确定输入频谱的给定时间-频率(T-F)单位是否由扩散或定向源控制。扩散信号当然是噪声信号,但是方向性信号可能对应于噪声或语音源。第二个特征是在以语音或定向噪声信号为主的T-F单元之间进行区分。使用基于建议特征计算的二进制掩码执行语音增强。在定向噪声场和扩散噪声场中,拟议的系统都将命中率高于85%的语音T-F单元进行隔离。在信噪比和语音质量改善的感知评估方面,它的性能优于以前的解决方案,尤其是在弥散噪声条件下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号