...
首页> 外文期刊>The Journal of the Acoustical Society of America >Intelligibility of reverberant noisy speech with ideal binary masking
【24h】

Intelligibility of reverberant noisy speech with ideal binary masking

机译:具有理想二进制掩盖的混响嘈杂语音的清晰度

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

For a mixture of target speech and noise in anechoic conditions, the ideal binary mask is defined as follows: It selects the time-frequency units where target energy exceeds noise energy by a certain local threshold and cancels the other units. In this study, the definition of the ideal binary mask is extended to reverberant conditions. Given the division between early and late reflections in terms of speech intelligibility, three ideal binary masks can be defined: an ideal binary mask that uses the direct path of the target as the desired signal, an ideal binary mask that uses the direct path and early reflections of the target as the desired signal, and an ideal binary mask that uses the reverberant target as the desired signal. The effects of these ideal binary mask definitions on speech intelligibility are compared across two types of interference: speech shaped noise and concurrent female speech. As suggested by psychoacoustical studies, the ideal binary mask based on the direct path and early reflections of target speech outperforms the other masks as reverberation time increases and produces substantial reductions in terms of speech reception threshold for normal hearing listeners.
机译:对于无回声条件下目标语音和噪声的混合,理想二进制掩码定义如下:选择目标能量超过噪声能量一定局部阈值的时频单位,并消除其他单位。在这项研究中,理想的二进制掩码的定义扩展到了混响条件。给定早期反射和晚期反射之间在语音清晰度方面的划分,可以定义三个理想的二进制掩码:使用目标的直接路径作为所需信号的理想二进制掩码,使用直接路径和早期的理想二进制掩码目标的反射作为目标信号,以及使用混响目标作为目标信号的理想二进制掩码。在两种类型的干扰之间比较了这些理想的二进制掩码定义对语音清晰度的影响:语音异形噪声和并发女性语音。正如心理声学研究所建议的,随着混响时间的增加,基于目标语音的直接路径和早期反射的理想二进制掩码的性能优于其他掩码,并且对于正常听众来说,其语音接收阈值将大大降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号