Formant estimation for speech recognition

Welling L.; Ney H.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Formant estimation for speech recognition

【24h】

Formant estimation for speech recognition

机译：语音识别的共振峰估计

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a new method for estimating formant frequencies. The formant model is based on a digital resonator. Each resonator represents a segment of the short-time power spectrum. The complete spectrum is modeled by a set of digital resonators connected in parallel. An algorithm based on dynamic programming produces both the model parameters and the segment boundaries that optimally match the spectrum. We used this method in experimental tests that were carried out on the TI digit string data base. The main results of the experimental tests are: (1) the presented approach produces reliable estimates of formant frequencies across a wide range of sounds and speakers; and (2) the estimated formant frequencies were used in a number of variants for recognition. The best set-up resulted in a string error rate of 4.2% on the adult corpus of the TI digit string data base

机译：本文提出了一种估计共振峰频率的新方法。共振峰模型基于数字谐振器。每个谐振器代表短时功率谱的一部分。完整的频谱由一组并联的数字谐振器建模。基于动态编程的算法会同时产生模型参数和与光谱最佳匹配的段边界。我们在TI数字字符串数据库上进行的实验测试中使用了此方法。实验测试的主要结果是：（1）提出的方法可以在各种声音和扬声器中产生可靠的共振峰频率估计；（2）将估计的共振峰频率用于多种变体中以进行识别。最佳设置导致TI数字字符串数据库的成年语料库的字符串错误率达到4.2％

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |1998年第1期|p.36-48|共13页
作者
Welling L.; Ney H.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
dynamic programming; frequency estimation; resonators; spectral analysis; speech processing; speech recognition; TI digit string data base; adult corpus; algorithm; digital resonators; dynamic programming; experimental tests; formant frequencies estimation; formant m;

机译：动态编程;频率估计;谐振器;频谱分析;语音处理;语音识别;TI数字字符串数据库;成人语料库;算法;数字谐振器;动态编程;实验测试;共振峰频率估计;共振峰;

相似文献

外文文献
中文文献
专利

1. Formant estimation for speech recognition [J] . Welling L., Ney H. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别的共振峰估计
2. Formant-Frequency Variation and Informational Masking of Speech by Extraneous Formants: Evidence Against Dynamic and Speech-Specific Acoustical Constraints [J] . Brian Roberts, Robert J. Summers, Peter J. Bailey Journal of experimental psychology. human perception and performance . 2014,第4期

机译：共振峰频率变化和外来共振峰对语音的信息掩盖：反对动态和特定于语音的声学约束的证据
3. Formant Analysis of Bangla Vowel for Automatic Speech Recognition [J] . Tonmoy Ghosh, Subir Saha, A. H. M. Iftekharul Ferdous Signal & Image Processing : An International Journal (SIPIJ) . 2016,第5期

机译：自动语音识别的孟加拉元音共振峰分析
4. Perceived Length of Czech High Vowels in Relation to Formant Frequencies Evaluated by Automatic Speech Recognition [C] . Tomas Boril, Jitka Veronkova International conference on text, speech, and dialogue . 2020

机译：通过自动语音识别评估的共振峰频率与捷克高元音的感知长度
5. Explicit N-best formant features for segment-based speech recognition. [D] . Schmid, Philipp Heinz. 1996

机译：基于段的语音识别的显式N最佳共振峰特征。
6. Formant-Frequency Variation and Informational Masking of Speech by Extraneous Formants: Evidence Against Dynamic and Speech-Specific Acoustical Constraints [O] . Brian Roberts, Robert J. Summers, Peter J. Bailey -1

机译：共振峰频率变化和外来共振峰对信息的掩盖：反对动态和特定于语音的声学约束的证据
7. Formant estimation for speech recognition [O] . Welling, Lutz, Ney, Hermann 1998

机译：语音识别的共振峰估计
8. Formants in Automatic Speech Recognition. [R] . Broad, D. J. 1972

机译：自动语音识别中的形式。

Formant estimation for speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅