首页> 外文期刊>Trends in Hearing >Modeling Sluggishness in Binaural Unmasking of Speech for Maskers With Time-Varying Interaural Phase Differences
【24h】

Modeling Sluggishness in Binaural Unmasking of Speech for Maskers With Time-Varying Interaural Phase Differences

机译:具有时变耳间相位差的掩蔽器的语音双耳非掩蔽慢速建模

获取原文
       

摘要

In studies investigating binaural processing in human listeners, relatively long and task-dependent time constants of a binaural window ranging from 10?ms to 250?ms have been observed. Such time constants are often thought to reflect “binaural sluggishness.” In this study, the effect of binaural sluggishness on binaural unmasking of speech in stationary speech-shaped noise is investigated in 10 listeners with normal hearing. In order to design a masking signal with temporally varying binaural cues, the interaural phase difference of the noise was modulated sinusoidally with frequencies ranging from 0.25?Hz to 64?Hz. The lowest, that is the best, speech reception thresholds (SRTs) were observed for the lowest modulation frequency. SRTs increased with increasing modulation frequency up to 4?Hz. For higher modulation frequencies, SRTs remained constant in the range of 1?dB to 1.5?dB below the SRT determined in the diotic situation. The outcome of the experiment was simulated using a short-term binaural speech intelligibility model, which combines an equalization–cancellation (EC) model with the speech intelligibility index. This model segments the incoming signal into 23.2-ms time frames in order to predict release from masking in modulated noises. In order to predict the results from this study, the model required a further time constant applied to the EC mechanism representing binaural sluggishness. The best agreement with perceptual data was achieved using a temporal window of 200?ms in the EC mechanism.
机译:在研究人类听众的双耳处理的研究中,观察到相对较长且与任务有关的双耳窗口时间常数范围为10µms至250µms。通常认为这样的时间常数反映出“双耳迟滞”。在这项研究中,研究了10名听力正常的听众中双耳呆滞对双耳不掩盖语音的影响。为了设计具有随时间变化的双耳提示的掩蔽信号,噪声的耳间相位差被正弦调制,频率范围为0.25?Hz至64?Hz。对于最低的调制频率,观察到最低的即最佳的语音接收阈值(SRT)。随着调制频率增加到4?Hz,SRT随之增加。对于更高的调制频率,SRT在低于偶数情况下确定的SRT的1?dB至1.5?dB的范围内保持恒定。使用短期双耳语音清晰度模型模拟了实验的结果,该模型将均衡消除(EC)模型与语音清晰度指数相结合。该模型将输入信号分成23.2毫秒的时间帧,以预测调制噪声中掩蔽的释放。为了预测此研究的结果,该模型要求将进一步的时间常数应用于代表双耳迟滞的EC机制。在EC机制中,使用200?ms的时间窗可以实现与感知数据的最佳一致性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号