首页> 外文OA文献 >Predicting phoneme and word recognition in noise using a computational model of the auditory periphery
【2h】

Predicting phoneme and word recognition in noise using a computational model of the auditory periphery

机译:使用听觉外围的计算模型预测噪声中的音素和单词识别

摘要

Several filterbank-based metrics have been proposed to predict speech intelligibility (SI). However, these metrics incorporate little knowledge of the auditory periphery. Neurogram-based metrics provide an alternative, incorporating knowledge of the physiology of hearing by using a mathematical model of the auditory nerve response. In this work, SI was assessed utilizing different filterbank-based metrics (the speech intelligibility index and the speech-based envelope power spectrum model) and neurogram-based metrics, using the biologically inspired model of the auditory nerve proposed by Zilany, Bruce, Nelson, and Carney [(2009), J. Acoust. Soc. Am. 126(5), 2390–2412] as a front-end and the neurogram similarity metric and spectro temporal modulation index as a back-end. Then, the correlations with behavioural scores were computed. Results showed that neurogram-based metrics representing the speech envelope showed higher correlations with thebehavioural scores at a word level. At a per-phoneme level, it was found that phoneme transitions contribute to higher correlations between objective measures that use speech envelope information at the auditory periphery level and behavioural data. The presented framework could function as a useful tool for the validation and tuning of speech materials, as well as a benchmark for the development of speech processing algorithms.
机译:已经提出了几种基于滤波器组的度量来预测语音清晰度(SI)。但是,这些度量标准很少包含听觉外围的知识。基于神经图的度量标准提供了一种替代方法,它通过使用听觉神经反应的数学模型来整合听力生理知识。在这项工作中,利用Zilany,Bruce,Nelson提出的听觉神经生物学模型,利用不同的基于滤波器组的指标(语音清晰度指数和基于语音的包络功率谱模型)和基于神经图的指标对SI进行了评估。和Carney [(2009),J。Acoust。 Soc。上午。 126(5),2390-2412]作为前端,神经图相似性度量和光谱时间调制指数作为后端。然后,计算与行为得分的相关性。结果表明,代表语音包络的基于神经图的度量标准与单词级别的行为得分具有更高的相关性。在每个音素级别,发现音素转换有助于在听觉外围级别使用语音包络信息的客观测量与行为数据之间的更高相关性。提出的框架可以用作语音材料的验证和调整的有用工具,以及语音处理算法开发的基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号