首页> 外文期刊>IEE Proceedings. Part K, Vision, image and signal processing >Multitapering and a wavelet variant of MFCC in speech recognition
【24h】

Multitapering and a wavelet variant of MFCC in speech recognition

机译:语音识别中MFCC的多锥度和小波变体

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In speech recognition (ASR) based on hidden Markov models (HMM) it is necessary to obtain a spectral approximation with a reduced set of representation coefficients. The author introduces to the speech parameterisation scheme multitapering and a modification of the usual mel frequency cepstrum coefficients (MFCC) processing scheme based on wavelets on intervals (wavelet frequency coefficients, WFC). Phoneme recognition performance improvements compared to the MFCC have been experimentally verified on data from a speech database, using multitapering and WFC.
机译:在基于隐马尔可夫模型(HMM)的语音识别(ASR)中,有必要获得具有减少的表示系数集的频谱近似。作者介绍了语音参数化方案的多锥度和基于间隔小波(小波频率系数,WFC)的通常的梅尔频率倒谱系数(MFCC)处理方案的修改。通过使用多锥度和WFC,已对语音数据库中的数据进行了实验验证,与MFCC相比,音素识别性能得到了改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号