...
首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Formant estimation for speech recognition
【24h】

Formant estimation for speech recognition

机译:语音识别的共振峰估计

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a new method for estimating formant frequencies. The formant model is based on a digital resonator. Each resonator represents a segment of the short-time power spectrum. The complete spectrum is modeled by a set of digital resonators connected in parallel. An algorithm based on dynamic programming produces both the model parameters and the segment boundaries that optimally match the spectrum. We used this method in experimental tests that were carried out on the TI digit string data base. The main results of the experimental tests are: (1) the presented approach produces reliable estimates of formant frequencies across a wide range of sounds and speakers; and (2) the estimated formant frequencies were used in a number of variants for recognition. The best set-up resulted in a string error rate of 4.2% on the adult corpus of the TI digit string data base
机译:本文提出了一种估计共振峰频率的新方法。共振峰模型基于数字谐振器。每个谐振器代表短时功率谱的一部分。完整的频谱由一组并联的数字谐振器建模。基于动态编程的算法会同时产生模型参数和与光谱最佳匹配的段边界。我们在TI数字字符串数据库上进行的实验测试中使用了此方法。实验测试的主要结果是:(1)提出的方法可以在各种声音和扬声器中产生可靠的共振峰频率估计; (2)将估计的共振峰频率用于多种变体中以进行识别。最佳设置导致TI数字字符串数据库的成年语料库的字符串错误率达到4.2%

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号