A psychoacoustically-motivated conceptual model for automatic speech recognition

机译：用于自动语音识别的心理听觉动机概念模型

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a conceptual model of human auditory perception processes for automatic speech recognition (ASR) systems. Even in complex noisy environments, humans can perceptually segregate the target sound that they apply attention to from an acoustic mixture. This paper names the primary advantage of human auditory perceptual process 慳ttentive ear.?We propose a psychoacoustically-motivated conceptual model of attentive ear, and introduce an ASR system as an application of our proposed model. Although psychoacoustically-motivated conceptual models based on human auditory processes for ASR system were proposed in decade, these are not as robust as the ASR systems using traditional noise reduction methods and adaptation methods based on statistical methodology. This is because properly, conceptual models based on human auditory processes were used as a preprocessor, connecting with a statistical recognizer in traditional ASR systems though a psychoacoustically-motivated conceptual model and a model based on statistical methodology in the ASR systems are di.erent theoretically. The ASR system based on our model is not a preprocessor for ASR system but an ASR system itself. To evaluate the ASR system based on our model, we carried out Japanese digit recognition experiments in six typical noisy environments. Results showed that our ASR system is more robust than traditional ones in experimental conditions of 0 dB SNR. These results suggest that our psychoacoustically-motivated conceptual model based on human auditory perceptual process, attentive ear, is e.ective for ASR, and robust in adverse noisy environment.

机译：本文介绍了用于自动语音识别（ASR）系统的人类听觉感知过程的概念模型。即使在复杂的嘈杂环境中，人类也可以在听觉上将注意力集中的目标声音与混合声音分离开来。本文提出了人类听觉感知过程“专注耳”的主要优势。我们提出了一种心理听觉动机的专注耳概念模型，并介绍了一种ASR系统作为我们提出的模型的应用。尽管十年来提出了基于人类听觉过程的ASR系统的心理听觉概念模型，但这些模型并不像使用传统降噪方法和基于统计方法的自适应方法的ASR系统那样健壮。这是因为基于人的听觉过程的概念模型被适当地用作预处理器，并与传统的ASR系统中的统计识别器连接，尽管理论上与心理听觉驱动的概念模型和基于统计方法的模型不同。。基于我们的模型的ASR系统不是ASR系统的预处理器，而是ASR系统本身。为了评估基于我们模型的ASR系统，我们在六个典型的嘈杂环境中进行了日语数字识别实验。结果表明，在0 dB SNR的实验条件下，我们的ASR系统比传统系统更强大。这些结果表明，基于人类听觉感知过程，专心的耳朵的基于心理听觉的概念模型对于ASR是有效的，并且在不利的嘈杂环境中也很健壮。

著录项

来源
《10th Western Pacific Acoustics Conference.》|2009年|p.1-8|共8页
会议地点 Beijing(CN);Beijing(CN)
作者
Atsushi Haniu; Masashi Unoki; Masato Akagi;
展开▼
作者单位

School of Information Science,Japan Advanced Institute of Science and Technology1–1 Asahidai,Nomi,Ishikawa,923–1292 Japan;

School of Information Science,Japan Advanced Institute of Science and Technology1–1 Asahidai,Nomi,Ishikawa,923–1292 Japan;

School of Information Science,Japan Advanced Institute of Science and Technology1–1 Asahidai,Nomi,Ishikawa,923–1292 Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类声学;声学;
关键词
入库时间 2022-08-26 14:23:07

相似文献

外文文献
中文文献
专利

1. Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L) [J] . Odette Scharenborg, Louis ten Bosch, Lou Boves, The Journal of the Acoustical Society of America . 2003,第6期

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型（L）
2. Speech Encoding in the Human Auditory Periphery: Modeling and Quantitative Assessment by Means of Automatic Speech Recognition [J] . Holmberg Marcus Fortschritt-Berichte VDI, Reihe 8. Mess-, Steuerungs- und Regelungstechnik . 2009,第1162期

机译：人类听觉外围的语音编码：借助自动语音识别的建模和定量评估
3. Critique: The potential role of speech production models in automatic speech recognition [J] . Roger K. Moore The Journal of the Acoustical Society of America . 1996,第3期

机译：批评：语音产生模型在自动语音识别中的潜在作用
4. A psychoacoustically-motivated conceptual model for automatic speech recognition [C] . Atsushi Haniu, Masashi Unoki, Masato Akagi Western Pacific Acoustics Conference . 2009

机译：一种用于自动语音识别的心理动力概念模型
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. Modelling Human Speech Recognition using Automatic Speech Recognition Paradigms in SpeM [O] . Scharenborg O.E., McQueen J.M., Bosch L.F.M. ten, 2003

机译：在SpeM中使用自动语音识别范例对人类语音识别进行建模

A psychoacoustically-motivated conceptual model for automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅