首页> 美国卫生研究院文献>Proceedings of the Royal Society B: Biological Sciences >The intelligibility of noise-vocoded speech: spectral information available from across-channel comparison of amplitude envelopes
【2h】

The intelligibility of noise-vocoded speech: spectral information available from across-channel comparison of amplitude envelopes

机译:噪声语音编码的清晰度:可从幅度包络的跨通道比较中获得频谱信息

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Noise-vocoded (NV) speech is often regarded as conveying phonetic information primarily through temporal-envelope cues rather than spectral cues. However, listeners may infer the formant frequencies in the vocal-tract output—a key source of phonetic detail—from across-band differences in amplitude when speech is processed through a small number of channels. The potential utility of this spectral information was assessed for NV speech created by filtering sentences into six frequency bands, and using the amplitude envelope of each band (≤30 Hz) to modulate a matched noise-band carrier (N). Bands were paired, corresponding to F1 (≈N1 + N2), F2 (≈N3 + N4) and the higher formants (F3′ ≈ N5 + N6), such that the frequency contour of each formant was implied by variations in relative amplitude between bands within the corresponding pair. Three-formant analogues (F0 = 150 Hz) of the NV stimuli were synthesized using frame-by-frame reconstruction of the frequency and amplitude of each formant. These analogues were less intelligible than the NV stimuli or analogues created using contours extracted from spectrograms of the original sentences, but more intelligible than when the frequency contours were replaced with constant (mean) values. Across-band comparisons of amplitude envelopes in NV speech can provide phonetically important information about the frequency contours of the underlying formants.
机译:噪声语音(NV)语音通常被认为主要通过时间包络线索而不是频谱线索来传递语音信息。但是,当通过少量通道处理语音时,听众可能会根据幅度上的跨频带幅度差异来推断声道输出(语音细节的关键来源)中的共振峰频率。通过将句子过滤到六个频带中并使用每个频带的振幅包络(≤30Hz)来调制匹配的噪声频带载波(N),来评估此NV语音频谱信息的潜在效用。频段配对,分别对应于F1(≈N1+ N2),F2(≈N3+ N4)和较高的共振峰(F3'≈N5 + N6),因此每个共振峰的频率轮廓都由相对振幅之间的变化来暗示对应对内的两个频段。使用每个共振峰的频率和幅度逐帧重构,合成了NV刺激的三个共振峰类似物(F0 = 150 Hz)。这些类似物的清晰度不如NV刺激或使用从原始句子的频谱图提取的轮廓创建的类似物,但比将频率等高线替换为恒定(均值)时更容易理解。 NV语音中幅度包络的跨频带比较可以提供有关基础共振峰频率轮廓的重要语音信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号