首页> 外文会议>International Congress on Acoustics >Comparison of ideal mask-based speech enhancement algorithms for white noise and low mixture signal-to-noise ratios
【24h】

Comparison of ideal mask-based speech enhancement algorithms for white noise and low mixture signal-to-noise ratios

机译:基于理想掩模语音增强算法的白噪声和低混合信噪比比的比较

获取原文

摘要

The intelligibility of noisy speech can be improved by applying an ideal binary or soft gain mask in the time-frequency domain for signal-to-noise ratios (SNRs) that are typically between -10 and +10 dB. In this study, two mask-based algorithms are compared when applied to speech mixed with white Gaussian noise (WGN) at low SNRs (from -29 to -5 dB). These comprise an Ideal Binary Mask (IBM) with a local criterion set to 0 dB and an Ideal Ratio Mask (IRM). The performance of Short-Time Objective Intelligibility (STOI), and a STOI variant (termed STOI+), is compared with that of other monaural intelligibility metrics that can be used before and after mask-based processing. The results show that IRMs can be used to obtain near maximal speech intelligibility (> 90% for sentence material) even at very low mixture SNRs, while IBMs with LC = 0 provide limited intelligibility gains for SNR < -14 dB. It is also shown that STOI+ is a suitable metric for speech mixed with WGN at low SNRs and processed by IBMs with LC = 0, even when the speech is high-pass filtered to flatten the spectral tilt.
机译:通过在时频域中应用理想的二进制或软增益掩模可以改善噪声语音的可理解性,以用于噪声比率(SNR),其通常在-10和+ 10dB之间。在该研究中,在将两个基于掩模的算法应用于与低SNR(从-29到-5 dB)应用于与白色高斯噪声(WGN)混合时进行比较。这些包括具有设置为0 dB的局部标准的理想二进制掩模(IBM)和理想比率掩模(IRM)。短时间客观智能性(STOI)和STOI变量(称为STOI +)的性能,与可以在基于掩模的处理之前和之后使用的其他单一的单一可懂度度量进行比较。结果表明,即使在非常低的混合SNR,IMM可用于获得近最大语音清晰度(句子材料的> 90%),而LC = 0的IBM为SNR <-14 dB提供有限的可懂度收益。还示出了STOI +是在低SNR下与WGN混合的语音的合适度量,并且通过LC = 0的IBM处理,即使语音高滤波以平坦倾斜浮动。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号