Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition

Allison Trine; Brian B. Monson

首页> 外文期刊>Trends in Hearing >Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition

【24h】

Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition

机译：扩展的高频提供频谱和时间信息，可以改善语音识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Several studies have demonstrated that extended high frequencies (EHFs; 8?kHz) in speech are not only audible but also have some utility for speech recognition, including for speech-in-speech recognition when maskers are facing away from the listener. However, the contribution of EHF spectral versus temporal information to speech recognition is unknown. Here, we show that access to EHF temporal information improved speech-in-speech recognition relative to speech bandlimited at 8?kHz but that additional access to EHF spectral detail provided an additional small but significant benefit. Results suggest that both EHF spectral structure and the temporal envelope contribute to the observed EHF benefit. Speech recognition performance was quite sensitive to masker head orientation, with a rotation of only 15° providing a highly significant benefit. An exploratory analysis indicated that pure-tone thresholds at EHFs are better predictors of speech recognition performance than low-frequency pure-tone thresholds.

机译：几项研究表明，演讲中延伸的高频（EHFS;> 8？kHz）不仅可以听到，而且还有一些用于语音识别的实用性，包括当屏蔽者背离侦听器时的语音识别。然而，EHF光谱与语音识别的时间信息的贡献是未知的。在这里，我们表明，访问EHF时间信息的访问相对于8？KHz的语音带入的语音识别，但是额外访问EHF光谱细节提供了额外的小但显着的益处。结果表明，EHF光谱结构和时间封套都有助于观察到的EHF益处。语音识别性能对掩蔽头方向非常敏感，旋转仅为15°，提供高度显着的好处。探索性分析表明EHFS处的纯音阈值是语音识别性能的更好预测因子，而不是低频纯音阈值。

著录项

来源
《Trends in Hearing》 |2020年第1期|共8页
作者
Allison Trine; Brian B. Monson;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
head orientationspeech in noisespeech perception;

机译：噪声语音感知的头定位言论;

Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition

摘要

著录项

相关主题

期刊订阅