首页> 美国卫生研究院文献>Trends in Hearing >Predicting Speech Perception in Older Listeners with Sensorineural Hearing Loss Using Automatic Speech Recognition
【2h】

Predicting Speech Perception in Older Listeners with Sensorineural Hearing Loss Using Automatic Speech Recognition

机译:使用自动语音识别预测具有传感器听力损失的较旧听众的言语感知

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The objective of this study was to provide proof of concept that the speech intelligibility in quiet of unaided older hearing-impaired (OHI) listeners can be predicted by automatic speech recognition (ASR). Twenty-four OHI listeners completed three speech-identification tasks using speech materials of varying linguistic complexity and predictability (i.e., logatoms, words, and sentences). An ASR system was first trained on different speech materials and then used to recognize the same speech stimuli presented to the listeners but processed to mimic some of the perceptual consequences of age-related hearing loss experienced by each of the listeners: the elevation of hearing thresholds (by linear filtering), the loss of frequency selectivity (by spectrally smearing), and loudness recruitment (by raising the amplitude envelope to a power). Independently of the size of the lexicon used in the ASR system, strong to very strong correlations were observed between human and machine intelligibility scores. However, large root-mean-square errors (RMSEs) were observed for all conditions. The simulation of frequency selectivity loss had a negative impact on the strength of the correlation and the RMSE. Highest correlations and smallest RMSEs were found for logatoms, suggesting that the prediction system reflects mostly the functioning of the peripheral part of the auditory system. In the case of sentences, the prediction of human intelligibility was significantly improved by taking into account cognitive performance. This study demonstrates for the first time that ASR, even when trained on intact independent speech material, can be used to estimate trends in speech intelligibility of OHI listeners.
机译:这项研究的目的是提供概念验证,在肉眼旧的听力受损(OHI)听众安静的语音清晰度可以通过自动语音识别(ASR)进行预测。二十四OHI听众完成使用不同语言的复杂性和可预测性(即,logatoms,词和句子)的语音材料3的语音识别任务。 ASR系统是首先训练上不同的语音材料,然后用于识别呈现给听众,但加工的每一个听众的经历与年龄相关的听力损失的感性后果模拟一些相同的言论刺激:听力阈值的升高(通过线性滤波),频率选择性的损失(由频谱拖尾),和响度募集(通过升高幅度包络的功率)。独立地在ASR系统中使用的词汇的大小,强到非常强的相关性人类和机器可理解分数之间观察到。然而,观察到在所有条件下大根均方误差(RMSEs)。频率选择性损失的模拟对相关和RMSE的实力产生负面影响。相关性最高的和最小的RMSEs被发现的logatoms,这表明该预测系统反映大多是听觉系统的外围部分的功能。在句子的情况下,人类可理解的预测考虑到认知性能显著提高。这项研究表明,第一次是ASR,在完整独立的演说材料训练有素,即使可以用来估计OHI听众的语音清晰度的趋势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号