首页> 外文会议>IASTED International Conference on Signal and Image Processing >TRIPHONE BASED CONTINUOUS SPEECH RECOGNITION SYSTEM FOR TURKISH LANGUAGE USING HIDDEN MARKOV MODEL
【24h】

TRIPHONE BASED CONTINUOUS SPEECH RECOGNITION SYSTEM FOR TURKISH LANGUAGE USING HIDDEN MARKOV MODEL

机译:采用隐马尔可夫模型的土耳其语言的三磡连续语音识别系统

获取原文

摘要

This paper introduces a system which is designed to perform a relatively accurate transcription of speech and in particular, continuous speech recognition based on triphone model for Turkish language. Turkish is generally different from Indo-European languages (English, Spanish, French, German etc.) by its agglutinative and suffixing morphology. Therefore vocabulary growth rate is very high and as a consequence, constructing a continuous speech recognition system for Turkish based on whole words is not feasible. By considering this fact in this paper, acoustic models which are based on triphones, are modelled as five state Hidden Markov Models (HMM). Mel-Frequency Cepstral Coefficients (MFCC) approach was preferred as the feature vector extraction method and training is done using embedding training that uses Baum-Welch re-estimation. Recognition is implemented on a search network which can be ultimately seen as HMM states connected by transitions and Viterbi Token Passing algorithm runs on this network to find the mostly likely state sequence according to the utterance. Also to make a more accurate recognition bigram language model is constructed.
机译:本文介绍了一种系统,该系统旨在基于土耳其语言的Triphone模型来执行相对准确的语音转录,特别是连续语音识别。土耳其语通常与印度欧洲语言(英语,西班牙语,法语,德语等)不同,通过其凝集和后缀形态。因此,词汇增长率非常高,因此,基于整个词的土耳其语构建连续语音识别系统是不可行的。通过考虑本文的这一事实,基于Triphones的声学模型被建模为五个状态隐马尔可夫模型(HMM)。熔融频率谱系数(MFCC)方法是优选的,因为使用嵌入训练使用使用Baum-Welch重新估计的嵌入训练来完成。识别在搜索网络上实现,该搜索网络可以最终被视为通过转换和维特比令牌传递算法在该网络上运行的HMM状态,以找到根据话语的最可能的状态序列。还要制定更准确的识别,构建了Bigram语言模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号