Acoustical Pre-Processing for Robust Speech Recognition.

机译：用于鲁棒语音识别的声学预处理。

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper we describe our initial efforts to make SPHINX, the CMU continuous speech recognition system, environmentally robust. Our work has two major goals: to enable SPHINX to adapt to changes in microphone and acoustical environment, and to improve the performance of SPHINX when it is trained and tested using a desk-top microphone. This talk will describe some of our work in acoustical pre-processing techniques, specifically spectral normalization and spectral subtraction performed using an efficient pair of algorithms that operate primarily in the cepstral domain. The effects of these signal processing algorithms on the recognition accuracy of the Sphinx speech recognition system was compared using speech simultaneously recorded from two types of microphones: the standard close-talking Sennheiser HMD224 microphone and the desk-top Crown PZM6fs microphone. A naturally- elicited alphanumeric speech database was used. In initial results using the stereo alphanumeric database, we found that both the spectral subtraction and spectral normalization algorithms were able to provide very substantial improvements in recognition accuracy when the system was trained on the close-talking microphone and tested on the desk-top microphone, or vice versa. Improving the recognition accuracy of the system when trained and tested on the desk-top microphone remains a difficult problem requiring more sophisticated noise suppression techniques.

著录项

作者
Stern, R. M.; Acero, A.;
展开▼
作者单位

展开▼
年度 1989
页码 1-9
总页数 9
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Signal processing; Speech recognition; Acoustics; Data bases; Environments; Speech; Normalizing(Statistics); Alphanumeric data; Microphones; Noise reduction; Efficiency; Spectra; Accuracy; Algorithms; Environmentally robust; Sphinx;

机译：信号处理;语音识别;声学;数据库;环境;语音;归一化（统计）;字母数字数据;麦克风;降噪;效率;光谱;准确度;算法;环境稳健;狮身人面像;

相似文献

外文文献
中文文献
专利

1. On the efficiency of classical RASTA filtering for continuous speech recognition: Keeping the balance between acoustic pre-processing and acoustic modelling [J] . Johan de Veth, Louis Boves Speech Communication . 2003,第3a4期

机译：关于用于连续语音识别的经典RASTA过滤的效率：保持声学预处理与声学建模之间的平衡
2. Context-adaptive pre-processing scheme for robust speech recognition in fast-varying noise environment [J] . Iosif Mporas, Todor Ganchev, Otilia Kocsis, Signal processing . 2011,第8期

机译：时变噪声环境下用于语音识别的上下文自适应预处理方案
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. Speech Input Pre-Processing for Car Driver Robust Automatic Speech Recognition [C] . Sacha Vrazic, Ippei Sugae, Hisashi Inaba, World congress on intelligent transport systems . 2013

机译：驾驶员可靠的语音自动识别语音输入预处理
5. Synergy of acoustic-phonetics and auditory modeling towards robust speech recognition. [D] . Deshmukh, Om D. 2006

机译：语音和听觉建模对强大语音识别的协同作用。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. Acoustical Pre-processing for Robust Speech Recognition [O] . Richard M. Stern, Ro Acero 1989

机译：用于鲁棒语音识别的声学预处理

Acoustical Pre-Processing for Robust Speech Recognition.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅