首页> 外文会议>2011 4th International Congress on Image and Signal Processing >Performances comparison between Improved DHMM and Gaussian Mixture HMM for speech recognition
【24h】

Performances comparison between Improved DHMM and Gaussian Mixture HMM for speech recognition

机译:改进的DHMM与高斯混合HMM在语音识别方面的性能比较

获取原文

摘要

This paper compares the performances, recognition rate and computation speed, between an Improved Discrete Hidden Markov Model (DHMM) and Gaussian Mixture Hidden Markov Model (GMHMM) for Mandarin speech recognition. The fuzzy vector quantization (FVQ) is used to improve the modeling of DHMM for the speech recognition. A codebook for DHMM will be first trained by K-means algorithms using Mandarin training speech feature. Then, based on the trained codebook, the speech features are quantized by the fuzzy sets and then are statistically applied to train the model of DHMM. Experimental results in this paper will show that the speech recognition rate can be improved by using FVQ algorithm to train the model of DHMM. The recognition rate by using an improved DHMM is only a little bit less than that by using GMHMM. However, the computation time for speech recognition by using improved DHMM is much less than that by using GMHMM. These results reveal that the improved DHMM is more suitable to real-time applications than GMHMM.
机译:本文比较了改进的离散隐马尔可夫模型(DHMM)和高斯混合隐马尔可夫模型(GMHMM)在普通话语音识别中的性能,识别率和计算速度。模糊矢量量化(FVQ)用于改进DHMM的语音识别模型。 DHMM的密码本将首先通过K-means算法使用普通话培训语音功能进行培训。然后,基于训练后的码本,通过模糊集对语音特征进行量化,然后将其统计应用以训练DHMM模型。实验结果表明,采用FVQ算法训练DHMM模型可以提高语音识别率。使用改进的DHMM的识别率仅比使用GMHMM的识别率低。但是,使用改进的DHMM进行语音识别的计算时间比使用GMHMM进行计算的时间要少得多。这些结果表明,改进的DHMM比GMHMM更适合于实时应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号