首页> 外文会议> >Effect of different sampling rates and feature vector sizes on speech recognition performance
【24h】

Effect of different sampling rates and feature vector sizes on speech recognition performance

机译:不同采样率和特征向量大小对语音识别性能的影响

获取原文
获取外文期刊封面目录资料

摘要

We conduct a systematic study to evaluate the effect of the sampling rate and feature vector size on the performance of a hidden Markov model (HMM) based speech recognizer. We investigate the use of the following two types of features: linear prediction (LP) derived cepstral coefficients (LPCC) and Mel frequency cepstral coefficients (MFCC). We demonstrate that for the LPCC front-end, the optimum sampling rate and feature vector size are 12 kHz and 14, respectively. We also show that for different sampling rates, the accuracy peaks at different sizes of the feature vector. For the MFCC front-end, the optimum feature vector size and sampling rate are 14 and 14 kHz, respectively.
机译:我们进行了系统的研究,以评估采样率和特征向量大小对基于隐马尔可夫模型(HMM)的语音识别器性能的影响。我们调查了以下两种类型的功能的使用:线性预测(LP)得出的倒谱系数(LPCC)和梅尔频率倒谱系数(MFCC)。我们证明,对于LPCC前端,最佳采样率和特征向量大小分别为12 kHz和14。我们还表明,对于不同的采样率,精度在特征向量的不同大小处达到峰值。对于MFCC前端,最佳特征向量大小和采样率分别为14和14 kHz。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号