首页> 外文期刊>Trends in Hearing >Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition
【24h】

Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition

机译:扩展的高频提供频谱和时间信息,可以改善语音识别

获取原文
       

摘要

Several studies have demonstrated that extended high frequencies (EHFs; 8?kHz) in speech are not only audible but also have some utility for speech recognition, including for speech-in-speech recognition when maskers are facing away from the listener. However, the contribution of EHF spectral versus temporal information to speech recognition is unknown. Here, we show that access to EHF temporal information improved speech-in-speech recognition relative to speech bandlimited at 8?kHz but that additional access to EHF spectral detail provided an additional small but significant benefit. Results suggest that both EHF spectral structure and the temporal envelope contribute to the observed EHF benefit. Speech recognition performance was quite sensitive to masker head orientation, with a rotation of only 15° providing a highly significant benefit. An exploratory analysis indicated that pure-tone thresholds at EHFs are better predictors of speech recognition performance than low-frequency pure-tone thresholds.
机译:几项研究表明,演讲中延伸的高频(EHFS;> 8?kHz)不仅可以听到,而且还有一些用于语音识别的实用性,包括当屏蔽者背离侦听器时的语音识别。然而,EHF光谱与语音识别的时间信息的贡献是未知的。在这里,我们表明,访问EHF时间信息的访问相对于8?KHz的语音带入的语音识别,但是额外访问EHF光谱细节提供了额外的小但显着的益处。结果表明,EHF光谱结构和时间封套都有助于观察到的EHF益处。语音识别性能对掩蔽头方向非常敏感,旋转仅为15°,提供高度显着的好处。探索性分析表明EHFS处的纯音阈值是语音识别性能的更好预测因子,而不是低频纯音阈值。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号