首页> 美国卫生研究院文献>Trends in Hearing >A Binaural Grouping Model for Predicting Speech Intelligibility in Multitalker Environments
【2h】

A Binaural Grouping Model for Predicting Speech Intelligibility in Multitalker Environments

机译:用于预测多通话者环境中语音清晰度的双耳分组模型

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Spatially separating speech maskers from target speech often leads to a large intelligibility improvement. Modeling this phenomenon has long been of interest to binaural-hearing researchers for uncovering brain mechanisms and for improving signal-processing algorithms in hearing-assistive devices. Much of the previous binaural modeling work focused on the unmasking enabled by binaural cues at the periphery, and little quantitative modeling has been directed toward the grouping or source-separation benefits of binaural processing. In this article, we propose a binaural model that focuses on grouping, specifically on the selection of time-frequency units that are dominated by signals from the direction of the target. The proposed model uses Equalization-Cancellation (EC) processing with a binary decision rule to estimate a time-frequency binary mask. EC processing is carried out to cancel the target signal and the energy change between the EC input and output is used as a feature that reflects target dominance in each time-frequency unit. The processing in the proposed model requires little computational resources and is straightforward to implement. In combination with the Coherence-based Speech Intelligibility Index, the model is applied to predict the speech intelligibility data measured by Marrone et al. The predicted speech reception threshold matches the pattern of the measured data well, even though the predicted intelligibility improvements relative to the colocated condition are larger than some of the measured data, which may reflect the lack of internal noise in this initial version of the model.
机译:在空间上将语音掩盖程序与目标语音隔离开通常会大大提高清晰度。长期以来,双耳听力研究人员一直希望对这种现象进行建模,以发现其大脑机制并改善助听器中的信号处理算法。先前的双耳建模工作大部分都集中在外围双耳线索实现的隐藏上,很少有定量建模针对双耳处理的分组或源分离优势。在本文中,我们提出了一种双耳模型,该模型侧重于分组,特别是在选择时频单位时,该时频单位受来自目标方向的信号支配。所提出的模型使用带有二进制决策规则的均衡取消(EC)处理来估计时频二进制掩码。进行EC处理以消除目标信号,并将EC输入和输出之间的能量变化用作反映每个时频单位中目标优势的功能。提出的模型中的处理需要很少的计算资源,并且易于实现。结合基于一致性的语音清晰度指数,该模型可用于预测由Marrone等人测量的语音清晰度数据。即使相对于共置条件的预测清晰度改善程度大于某些测量数据,预测语音接收阈值也很好地匹配了测量数据的模式,这可能反映了该模型的初始版本中内部噪声的缺乏。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号