首页> 外文期刊>American journal of applied sciences >A HYBRID SPEECH RECOGNITION SYSTEM WITH HIDDEN MARKOV MODEL AND RADIAL BASIS FUNCTION NEURAL NETWORK | Science Publications
【24h】

A HYBRID SPEECH RECOGNITION SYSTEM WITH HIDDEN MARKOV MODEL AND RADIAL BASIS FUNCTION NEURAL NETWORK | Science Publications

机译:具有隐马尔可夫模型的混合语音识别系统和径向基功能神经网络|科学出版物

获取原文
           

摘要

> We analyze the performance of continuous speech recognition of a speaker independent system using Hidden Markov Model and Artificial Neural Network. Modern speech recognition systems use different combinations of the standard techniques over the basic approach to improve performance accuracy. One such combination which has gained more attention is the hybrid model. Our hybrid system for continuous speech recognition consists of a combination of Hidden Markov Model in the front end and the Neural Network with Radial basis function as the back end. The speech recognition process consists of the training phase and the recognition phase. The speech sentences are pre-processed and the features are extracted. The extracted feature vector is clustered into a model database by Hidden Markov Model and is trained by the Radial Basis Function Neural Network. During the recognition phase, the continuous sentence is pre-processed and its feature vector is modelled. This is compared with the database model which contains models stored during the training process. When a match occurs, the model is recognized and the recognition is made for the least error. From the recognized output the word error rate is computed, which is a measure of recognition performance of the hybrid model. The audio files of continuous sentences are taken from the TIMIT database. The performance of our hybrid HMM/RBFNN gives 65% recognition rate.
机译: >我们使用隐马尔可夫模型和人工神经网络分析扬声器独立系统的连续语音识别性能。现代语音识别系统使用标准技术的不同组合来通过基本方法来提高性能准确性。一种这样的组合,其具有更多关注的是混合模型。我们的连续语音识别的混合系统包括前端和神经网络中的隐马尔可夫模型的组合,具有径向基函数作为后端。语音识别过程包括训练阶段和识别阶段。语音句是预处理的,提取功能。提取的特征向量通过隐藏的马尔可夫模型群集到模型数据库中,并由径向基函数神经网络训练。在识别阶段期间,预处理连续句子并进行建模的特征向量。将其与包含在培训过程中存储的模型的数据库模型进行比较。当发生匹配时,识别模型并为最小错误进行识别。从识别的输出,计算字错误率,这是混合模型的识别性能的量度。连续句子的音频文件取自Timit数据库。我们的杂交型HMM / RBFNN的表现提供了65%的识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号