首页> 外文期刊>The Journal of the Acoustical Society of America >A model of auditory perception as front end for automatic speech recognition.
【24h】

A model of auditory perception as front end for automatic speech recognition.

机译:听觉感知模型作为自动语音识别的前端。

获取原文
获取原文并翻译 | 示例
           

摘要

A front end for automatic speech recognizers is proposed and evaluated which is based on a quantitative model of the "effective" peripheral auditory processing. The model simulates both spectral and temporal properties of sound processing in the auditory system which were found in psychoacoustical and physiological experiments. The robustness of the auditory-based representation of speech was evaluated in speaker-independent, isolated word recognition experiments in different types of additive noise. The results show a higher robustness of the auditory front end in noise, compared to common mel-scale cepstral feature extraction. In a second set of experiments, different processing stages of the auditory front end were modified to study their contribution to robust speech signal representation in detail. The adaptive compression stage which enhances temporal changes of the input signal appeared to be the most important processing stage towards robust speech representation in noise. Low-pass filtering of the fast fluctuating envelope in each frequency band further reduces the influence of noise in the auditory-based representation of speech.
机译:提出并评估了自动语音识别器的前端,该前端基于“有效”外围听觉处理的定量模型。该模型模拟了在心理声学和生理学实验中发现的听觉系统中声音处理的频谱和时间特性。在不同类型的加性噪声​​中,在独立于说话者的独立单词识别实验中,评估了基于听觉的语音表示的鲁棒性。结果表明,与普通的梅尔尺度倒谱特征提取相比,听觉前端在噪声中具有更高的鲁棒性。在第二组实验中,对听觉前端的不同处理阶段进行了修改,以详细研究其对鲁棒语音信号表示的贡献。增强输入信号的时间变化的自适应压缩级似乎是在噪声中实现鲁棒语音表示的最重要的处理阶段。在每个频带中快速波动的包络线的低通滤波进一步降低了噪声在基于听觉的语音表示中的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号