首页> 中文学位 >Hidden Markov Model Based Automatic Speech Recognition Using Mel Frequency Cepstral Coefficients In Nepalese
【6h】

Hidden Markov Model Based Automatic Speech Recognition Using Mel Frequency Cepstral Coefficients In Nepalese

代理获取

目录

Hunau University Statement of Originality and Copyright Statement

List of Tables、List of Figures

英文文摘

Acknowledgements

Chapter 1 Introduction to ASR

Chapter 2 Understanding the features of Nepali Language

Chapter 3 Classification of Nepalese phonemes

Chapter 4 Hidden Markov Model

Chapter 5 Understanding Hidden Markov Model Tool Kit (HTK)

Chapter 6 Implementation Details

References

展开▼

摘要

Nepalese (also called Nepali) is a language of some importance in the northern part of South Asia and is spoken mainly in Nepal, Bhutan and India. The impetus behind this undertaking to implement automatic speech recognition in Nepalese has been the fact that little research has been done in this area compared to the plethora of materials available for other languages like English. Hidden Markov models will be used with MFCC (Mel Frequency Cepstral Coefficients) analysis in the project. HMM,though applicable in many other pattern recognizers as well, has gained a prominent niche in ASR. The system, designed using HTK [1 HTKBook], starts with a preprocessing stage, which converts a speech waveform into feature vectors. The second stage is training the recognizer. Lastly, it will be used to decode new speech data. The building-block components of the system are phoneme-level statistical models. Word-level acoustic models will be formed by concatenating phone-level models according to a pronunciation dictionary. These word models will then be combined with a language model, which constrains the utterances to valid word sequences.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号