首页> 外文期刊>Science >SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES
【24h】

SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES

机译:语音识别与主要时间提示

获取原文
获取原文并翻译 | 示例
       

摘要

Nearly perfect speech recognition was observed under conditions of greatly reduced spectral information. Temporal envelopes of speech were extracted from broad frequency bands and were used to modulate noises of the same bandwidths. This manipulation preserved temporal envelope cues in each band but restricted the listener to severely degraded information on the distribution of spectral energy. The identification of consonants, vowels, and words in simple sentences improved markedly as the number of bands increased; high speech recognition performance was obtained with only three bands of modulated noise. Thus, the presentation of a dynamic temporal pattern in only a few broad spectral regions is sufficient for the recognition of speech.
机译:在频谱信息大大减少的情况下,观察到了近乎完美的语音识别。从宽频带中提取语音的时间包络,并用于调制相同带宽的噪声。这种操作保留了每个频带中的时间包络提示,但是限制了收听者获得关于频谱能量分布的严重退化的信息。随着乐队数量的增加,简单句子中的辅音,元音和单词的识别能力显着提高。仅在三个调制噪声带中获得了很高的语音识别性能。因此,仅在几个宽频谱区域中呈现动态时间模式就足以识别语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号