首页> 外文会议>10th Western Pacific Acoustics Conference. >A psychoacoustically-motivated conceptual model for automatic speech recognition
【24h】

A psychoacoustically-motivated conceptual model for automatic speech recognition

机译:用于自动语音识别的心理听觉动机概念模型

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a conceptual model of human auditory perception processes for automatic speech recognition (ASR) systems. Even in complex noisy environments, humans can perceptually segregate the target sound that they apply attention to from an acoustic mixture. This paper names the primary advantage of human auditory perceptual process 慳ttentive ear.?We propose a psychoacoustically-motivated conceptual model of attentive ear, and introduce an ASR system as an application of our proposed model. Although psychoacoustically-motivated conceptual models based on human auditory processes for ASR system were proposed in decade, these are not as robust as the ASR systems using traditional noise reduction methods and adaptation methods based on statistical methodology. This is because properly, conceptual models based on human auditory processes were used as a preprocessor, connecting with a statistical recognizer in traditional ASR systems though a psychoacoustically-motivated conceptual model and a model based on statistical methodology in the ASR systems are di.erent theoretically. The ASR system based on our model is not a preprocessor for ASR system but an ASR system itself. To evaluate the ASR system based on our model, we carried out Japanese digit recognition experiments in six typical noisy environments. Results showed that our ASR system is more robust than traditional ones in experimental conditions of 0 dB SNR. These results suggest that our psychoacoustically-motivated conceptual model based on human auditory perceptual process, attentive ear, is e.ective for ASR, and robust in adverse noisy environment.
机译:本文介绍了用于自动语音识别(ASR)系统的人类听觉感知过程的概念模型。即使在复杂的嘈杂环境中,人类也可以在听觉上将注意力集中的目标声音与混合声音分离开来。本文提出了人类听觉感知过程“专注耳”的主要优势。我们提出了一种心理听觉动机的专注耳概念模型,并介绍了一种ASR系统作为我们提出的模型的应用。尽管十年来提出了基于人类听觉过程的ASR系统的心理听觉概念模型,但这些模型并不像使用传统降噪方法和基于统计方法的自适应方法的ASR系统那样健壮。这是因为基于人的听觉过程的概念模型被适当地用作预处理器,并与传统的ASR系统中的统计识别器连接,尽管理论上与心理听觉驱动的概念模型和基于统计方法的模型不同。 。基于我们的模型的ASR系统不是ASR系统的预处理器,而是ASR系统本身。为了评估基于我们模型的ASR系统,我们在六个典型的嘈杂环境中进行了日语数字识别实验。结果表明,在0 dB SNR的实验条件下,我们的ASR系统比传统系统更强大。这些结果表明,基于人类听觉感知过程,专心的耳朵的基于心理听觉的概念模型对于ASR是有效的,并且在不利的嘈杂环境中也很健壮。

著录项

  • 来源
  • 会议地点 Beijing(CN);Beijing(CN)
  • 作者单位

    School of Information Science,Japan Advanced Institute of Science and Technology1–1 Asahidai,Nomi,Ishikawa,923–1292 Japan;

    School of Information Science,Japan Advanced Institute of Science and Technology1–1 Asahidai,Nomi,Ishikawa,923–1292 Japan;

    School of Information Science,Japan Advanced Institute of Science and Technology1–1 Asahidai,Nomi,Ishikawa,923–1292 Japan;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 声学;声学;
  • 关键词

  • 入库时间 2022-08-26 14:23:07

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号