首页> 外文期刊>Signal Processing Letters, IEEE >Improving Speech Intelligibility in Noise Using a Binary Mask That Is Based on Magnitude Spectrum Constraints
【24h】

Improving Speech Intelligibility in Noise Using a Binary Mask That Is Based on Magnitude Spectrum Constraints

机译:使用基于幅度谱约束的二进制掩码提高语音中的语音清晰度

获取原文
获取原文并翻译 | 示例

摘要

A new binary mask is introduced for improving speech intelligibility based on magnitude spectrum constraints. The proposed binary mask is designed to retain time-frequency (T-F) units of the mixture signal satisfying a magnitude constraint while discarding T-F units violating the constraint. Motivated by prior intelligibility studies of speech synthesized using the ideal binary mask, an algorithm is proposed that decomposes the input signal into T-F units and makes binary decisions, based on a Bayesian classifier, as to whether each T-F unit satisfies the magnitude constraint or not. Speech corrupted at low signal-to-noise (SNR) levels (-5 and 0 dB) using different types of maskers is synthesized by this algorithm and presented to normal-hearing listeners for identification. Results indicated substantial improvements in intelligibility over that attained by human listeners with unprocessed stimuli.
机译:引入了一种新的二进制掩码,用于基于幅度谱约束来提高语音清晰度。提出的二进制掩码设计为保留满足幅度限制的混合信号的时频(T-F)单元,同时丢弃违反该限制的T-F单元。基于对使用理想二进制掩码合成的语音的先前清晰度研究的启发,提出了一种算法,该算法将输入信号分解为T-F单元,并基于贝叶斯分类器做出关于每个T-F单元是否满足幅度约束的二进制决策。该算法合成了使用不同类型的掩蔽器以低信噪比(SNR)级别(-5和0 dB)损坏的语音,并将其呈现给正常听众以进行识别。结果表明,与听众未经处理的刺激相比,其清晰度明显提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号