Perceptual speech processing and phonetic feature mapping forrobust vowel recognition

Linkai Bu; Church T.-D.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Perceptual speech processing and phonetic feature mapping forrobust vowel recognition

【24h】

Perceptual speech processing and phonetic feature mapping forrobust vowel recognition

机译：感知语音处理和语音特征映射可增强元音识别能力

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose perceptual speech processing and phonetic feature mapping, which are inspired by the human auditory perceptual characteristics. The proposed perceptual speech processing is based on three perceptual characteristics and consists of three independent processing steps: masking effect, minimum audible field renormalization, and mel-scale resampling. They remove unperceptible spectral components, and adjust the magnitude and frequency scales of speech spectra, respectively. We apply these three processing steps to the speech spectrum sequentially to generate a new speech signal representation called the perceptual spectrum. For Mandarin vowel recognition, nine representative vowels are selected as references and similarity measures to these reference spectra, called phonetic features, are then generated from the perceptual spectrum. These phonetic features then serve as speech parameters in a continuous HMM-based recognition, stage. With these two techniques, a high recognition accuracy on Mandarin vowel phonemes has been achieved. Further experiments confirm that significant improvement on recognition robustness with respect to speaker variation and noise contamination can also obtained

机译：我们提出了受人类听觉感知特性启发的感知语音处理和语音特征映射。所提出的感知语音处理基于三个感知特征，并且包括三个独立的处理步骤：掩蔽效果，最小可听场重归一化和梅尔音阶重采样。它们消除了无法感知的频谱分量，并分别调整了语音频谱的幅度和频率范围。我们将这三个处理步骤依次应用于语音频谱，以生成称为语音频谱的新语音信号表示形式。对于普通话元音识别，选择了九个代表性元音作为参考，然后从感知光谱中生成与这些参考光谱的相似性度量（称为语音特征）。这些语音特征然后在基于HMM的连续识别阶段中用作语音参数。通过这两种技术，已经实现了对普通话元音音素的高识别精度。进一步的实验证实，在说话人变化和噪声污染方面，识别鲁棒性也得到了显着提高。

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |2000年第2期|p.105-114|共10页
作者
Linkai Bu; Church T.-D.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
feature extraction; hearing; hidden Markov models; natural languages; noise; signal representation; signal sampling; spectral analysis; speech processing; speech recognition; Mandarin vowel phonemes; Mandarin vowel recognition; continuous HMM-based recognition; exper;

机译：特征提取;听力;隐马尔可夫模型;自然语言;噪声;信号表示;信号采样;频谱分析;语音处理;语音识别;普通话元音音素;普通话元音识别;基于HMM的连续识别;专家;

相似文献

外文文献
中文文献
专利

1. Perceptual speech processing and phonetic feature mapping for robust vowel recognition [J] . Linkai Bu, Church T.-D. IEEE Transactions on Speech and Audio Proceeding . 2000,第2期

机译：感知语音处理和语音特征映射可增强元音识别能力
2. Perceptual Doping: An Audiovisual Facilitation Effect on Auditory Speech Processing, From Phonetic Feature Extraction to Sentence Identification in Noise. [J] . Shahram Moradi, Bj?rn Lidestam, Elaine Hoi Ning Ng, Ear and hearing. . 2019,第2期

机译：感知兴奋剂：对听觉语音处理的视听促进效果，从噪音中的语音特征提取到噪声识别。
3. Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone speech [J] . Lee J.-W., Choi J.-Y., Kang H.-G. The Journal of the Acoustical Society of America . 2012,第2aPta1期

机译：使用语音语音中语音特征的特征外推法对辅音元音上下文中的停止位置进行分类
4. EARLY AUDITORY PROCESSING INSPIRED FEATURES FORROBUST AUTOMATIC SPEECH RECOGNITION [C] . Ozlem Kalinli, Shrikanth Narayanan EUSIPCO 2007;European signal processing conference . 2007

机译：早期听觉处理功能启发了自动语音识别功能
5. Speech recognition based on phonetic features and acoustic landmarks. [D] . Juneja, Amit. 2004

机译：基于语音特征和声学界标的语音识别。
6. Perceptual Doping: An Audiovisual Facilitation Effect on Auditory Speech Processing From Phonetic Feature Extraction to Sentence Identification in Noise [O] . Shahram Moradi, Björn Lidestam, Elaine Hoi Ning Ng, -1

机译：知觉兴奋剂：从语音特征提取到噪声中的句子识别对听觉语音处理的视听促进作用
7. Perceptual Doping: An Audiovisual Facilitation Effect on Auditory Speech Processing, From Phonetic Feature Extraction to Sentence Identification in Noise [O] . Shahram Moradi, Björn Lidestam, Elaine Hoi Ning Ng, 2019

机译：感知兴奋剂：对听觉语音处理的视听促进效果，从语音特征提取到噪声中的句子识别
8. Simulation and Evaluation of Phonetic Speech Recognition Techniques. Vol. IV. An Indexed Bibliography on Speech Analysis, Synthesis, and Processing [R] . Otten, K. W. 1964

机译：语音识别技术的仿真与评估。卷。 IV。语音分析，综合和处理的索引参考书目

Perceptual speech processing and phonetic feature mapping forrobust vowel recognition

摘要

著录项

相似文献

相关主题

期刊订阅