首页> 外文会议>IEEE Region 10 Annual Conference >Effect of different sampling rates and feature vector sizes on speech recognition performance
【24h】

Effect of different sampling rates and feature vector sizes on speech recognition performance

机译:不同采样率的影响和特征向量大小对语音识别性能的影响

获取原文

摘要

We conduct a systematic study to evaluate the effect of the sampling rate and feature vector size on the performance of a hidden Markov model (HMM) based speech recognizer. We investigate the use of the following two types of features: linear prediction (LP) derived cepstral coefficients (LPCC) and Mel frequency cepstral coefficients (MFCC). We demonstrate that for the LPCC front-end, the optimum sampling rate and feature vector size are 12 kHz and 14, respectively. We also show that for different sampling rates, the accuracy peaks at different sizes of the feature vector. For the MFCC front-end, the optimum feature vector size and sampling rate are 14 and 14 kHz, respectively.
机译:我们进行系统研究,以评估采样率和特征向量大小对基于隐马尔可夫模型(HMM)的语音识别器的性能的影响。我们研究了以下两种特征的使用:线性预测(LP)衍生的倒谱系数(LPCC)和MEL频率谱系数(MFCC)。我们证明,对于LPCC前端,最佳采样率和特征向量尺寸分别为12 kHz和14。我们还表明,对于不同的采样率,特征向量的不同尺寸的精度峰。对于MFCC前端,最佳特征向量尺寸和采样率分别为14和14 kHz。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号