首页> 外文会议>6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16.-Oct.20 2000 Beijing International Convention Center,Beijing, China >An invetigation of variable block length methods for calculation fo spectral/temporal features for automatic speech recognition
【24h】

An invetigation of variable block length methods for calculation fo spectral/temporal features for automatic speech recognition

机译:研究用于语音自动识别的频谱/时间特征的可变块长方法

获取原文

摘要

This paper presents an investigation of non-uniform time sampling methods for spectral/temporal feature extraction for use in automatic speech recognition. In most current methods for signal modeling of speech information. "dynamic" features are determined from frame-based parameters using a fixed time sampling, i.e. fixed block length and fixed block spacing. This work explores new methods in which block length and or block spacing are variable. Three methods are suggested and each was tested with the TIMIT database using a standard HMM recognizer. Phone recognition experiments were conducted using the standard 39 phone set. The methods were also evaluated with various HMM model complexities. Experimental results indicated that none of the proposed non-uniform feature time sampling methods perform significantly better than fixed time sampling methos. However. The best results obtained with the front end are comparable to those obtained with current state-of-the-art systems. Also the performance of our monophone system surpasses that of most reported context-dependent monophone systems.
机译:本文介绍了用于自动语音识别的频谱/时间特征提取的非均匀时间采样方法。在大多数当前的语音信息信号建模方法中。使用固定的时间采样,即固定的块长度和固定的块间隔,从基于帧的参数确定“动态”特征。这项工作探索了块长度和/或块间距可变的新方法。提出了三种方法,每种方法都使用标准的HMM识别器在TIMIT数据库中进行了测试。使用标准39电话机进行电话识别实验。还用各种HMM模型复杂度评估了这些方法。实验结果表明,所提出的非均匀特征时间采样方法均没有明显优于固定时间采样方法。然而。前端获得的最佳结果可与当前最新系统获得的结果相媲美。同样,我们的单声道电话系统的性能也超过了大多数报道的上下文相关的单声道电话系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号