A model of auditory perception as front end for automatic speech recognition.

Tchorz J; Kollmeier B

首页> 外文期刊>The Journal of the Acoustical Society of America >A model of auditory perception as front end for automatic speech recognition.

【24h】

A model of auditory perception as front end for automatic speech recognition.

机译：听觉感知模型作为自动语音识别的前端。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A front end for automatic speech recognizers is proposed and evaluated which is based on a quantitative model of the "effective" peripheral auditory processing. The model simulates both spectral and temporal properties of sound processing in the auditory system which were found in psychoacoustical and physiological experiments. The robustness of the auditory-based representation of speech was evaluated in speaker-independent, isolated word recognition experiments in different types of additive noise. The results show a higher robustness of the auditory front end in noise, compared to common mel-scale cepstral feature extraction. In a second set of experiments, different processing stages of the auditory front end were modified to study their contribution to robust speech signal representation in detail. The adaptive compression stage which enhances temporal changes of the input signal appeared to be the most important processing stage towards robust speech representation in noise. Low-pass filtering of the fast fluctuating envelope in each frequency band further reduces the influence of noise in the auditory-based representation of speech.

机译：提出并评估了自动语音识别器的前端，该前端基于“有效”外围听觉处理的定量模型。该模型模拟了在心理声学和生理学实验中发现的听觉系统中声音处理的频谱和时间特性。在不同类型的加性噪声中，在独立于说话者的独立单词识别实验中，评估了基于听觉的语音表示的鲁棒性。结果表明，与普通的梅尔尺度倒谱特征提取相比，听觉前端在噪声中具有更高的鲁棒性。在第二组实验中，对听觉前端的不同处理阶段进行了修改，以详细研究其对鲁棒语音信号表示的贡献。增强输入信号的时间变化的自适应压缩级似乎是在噪声中实现鲁棒语音表示的最重要的处理阶段。在每个频带中快速波动的包络线的低通滤波进一步降低了噪声在基于听觉的语音表示中的影响。

著录项

来源
《The Journal of the Acoustical Society of America》 |1999年第1期|共11页
作者
Tchorz J; Kollmeier B;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类声学;
关键词
Auditory Perception: Physiology; Automatism; Human; Models; Biological; Models; Psychological; Noise; Speech Perception: Physiology; Time Factors;

机译：听觉感知：生理;自动;人类;模型;生物学;模型;心理;噪声;语音感知：生理;时间因素;

相似文献

外文文献
中文文献
专利

1. A model of auditory perception as front end for automatic speech recognition. [J] . Tchorz J, Kollmeier B The Journal of the Acoustical Society of America . 1999,第4aPta1期

机译：听觉感知模型作为自动语音识别的前端。
2. Speech Encoding in the Human Auditory Periphery: Modeling and Quantitative Assessment by Means of Automatic Speech Recognition [J] . Holmberg Marcus Fortschritt-Berichte VDI, Reihe 8. Mess-, Steuerungs- und Regelungstechnik . 2009,第1162期

机译：人类听觉外围的语音编码：借助自动语音识别的建模和定量评估
3. Comparing auditory perception and speech production outcomes: non-language specific assessment of auditory perception and speech production in children with cochlear implants. [J] . Phillips L, Hassanzadeh S, Kosaner J, Cochlear implants international . 2009,第2期

机译：比较听觉和言语产生的结果：对非人工耳蜗植入儿童的听觉和言语产生进行非语言特定评估。
4. Auditory front-ends for noise-robust automatic speech recognition [C] . 2010 7th International Symposium on Chinese Spoken Language Processing . 2010

机译：听觉前端，用于抗噪自动语音识别
5. Modeling auditory perception for robust speech recognition. [D] . Strope, Brian P. 1998

机译：建模听觉感知以增强语音识别能力。
6. Auditory cortical deactivation during speech production and following speech perception: an EEG investigation of the temporal dynamics of the auditory alpha rhythm [O] . David Jenson, Ashley W. Harkrider, David Thornton, 2015

机译：语音产生和语音感知后听觉皮层失活：听觉α节奏的时间动态的脑电图调查。
7. A Markov Random Field Model for Automatic Speech Recognition. [O] . Guillaume Gravier, Marc Sigelle 2008

机译：一种用于自动语音识别的马尔可夫随机场模型。

A model of auditory perception as front end for automatic speech recognition.

摘要

著录项

相似文献

相关主题

期刊订阅