TRIPHONE BASED CONTINUOUS SPEECH RECOGNITION SYSTEM FOR TURKISH LANGUAGE USING HIDDEN MARKOV MODEL

机译：采用隐马尔可夫模型的土耳其语言的三磡连续语音识别系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a system which is designed to perform a relatively accurate transcription of speech and in particular, continuous speech recognition based on triphone model for Turkish language. Turkish is generally different from Indo-European languages (English, Spanish, French, German etc.) by its agglutinative and suffixing morphology. Therefore vocabulary growth rate is very high and as a consequence, constructing a continuous speech recognition system for Turkish based on whole words is not feasible. By considering this fact in this paper, acoustic models which are based on triphones, are modelled as five state Hidden Markov Models (HMM). Mel-Frequency Cepstral Coefficients (MFCC) approach was preferred as the feature vector extraction method and training is done using embedding training that uses Baum-Welch re-estimation. Recognition is implemented on a search network which can be ultimately seen as HMM states connected by transitions and Viterbi Token Passing algorithm runs on this network to find the mostly likely state sequence according to the utterance. Also to make a more accurate recognition bigram language model is constructed.

机译：本文介绍了一种系统，该系统旨在基于土耳其语言的Triphone模型来执行相对准确的语音转录，特别是连续语音识别。土耳其语通常与印度欧洲语言（英语，西班牙语，法语，德语等）不同，通过其凝集和后缀形态。因此，词汇增长率非常高，因此，基于整个词的土耳其语构建连续语音识别系统是不可行的。通过考虑本文的这一事实，基于Triphones的声学模型被建模为五个状态隐马尔可夫模型（HMM）。熔融频率谱系数（MFCC）方法是优选的，因为使用嵌入训练使用使用Baum-Welch重新估计的嵌入训练来完成。识别在搜索网络上实现，该搜索网络可以最终被视为通过转换和维特比令牌传递算法在该网络上运行的HMM状态，以找到根据话语的最可能的状态序列。还要制定更准确的识别，构建了Bigram语言模型。

著录项

来源
《IASTED International Conference on Signal and Image Processing》|2010年||共5页
会议地点
作者
Fatma Patlar; Akhan Akbulut;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN911.7-53;
关键词
Continuous Speech Recognition; Triphone; Hidden Markov Model; Language Modelling; Bigram language model; Turkish;

机译：连续语音识别;三磡;隐藏马尔可夫型号;语言建模;BIGRAM语言模型;土耳其人;
入库时间 2022-08-20 19:51:13

相似文献

外文文献
中文文献
专利

1. Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems [J] . Febe de Wet, Neil Kleynhans, Dirk van Compernolle, South African Journal of Science . 2017,第1a2期

机译：资源不足语言的语音识别：隐马尔可夫模型系统中的数据共享
2. Word and Triphone Based Approaches in Continuous Speech Recognition for Tamil Language [J] . R. THANGARAJAN, A. M. NATARAJAN, M. SELVAM WSEAS Transactions on Signal Processing . 2008,第3期

机译：泰米尔语语言中基于单词和三音素的连续语音识别方法
3. A Configurable Logic Based Architecture for Real-Time Continuous Speech Recognition Using Hidden Markov Models [J] . PANAGIOTIS STOGIANNOS, APOSTOLOS DOLLAS, VASSILIS DIGALAKIS Journal of VLSI signal processing . 2000,第2a3期

机译：基于隐马尔可夫模型的基于可配置逻辑的实时连续语音识别架构
4. TRIPHONE BASED CONTINUOUS SPEECH RECOGNITION SYSTEM FOR TURKISH LANGUAGE USING HIDDEN MARKOV MODEL [C] . Fatma Patlar, Akhan Akbulut Proceedings of the 12th IASTED international conference on signal and image processing . 2010

机译：基于隐马尔可夫模型的基于三通的土耳其语连续语音识别系统
5. American Sign Language recognition: Reducing the complexity of the task with phoneme-based modeling and parallel hidden Markov models. [D] . Vogler, Christian Philipp. 2003

机译：美国手语识别：通过基于音素的建模和并行隐马尔可夫模型，降低了任务的复杂性。
6. Enhancing Speech Recognition Using Improved Particle Swarm Optimization Based Hidden Markov Model [O] . Lokesh Selvaraj, Balakrishnan Ganesan -1

机译：基于隐马尔可夫模型的改进粒子群算法增强语音识别
7. Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems [O] . de Wet Febe, Kleynhans Neil, Van Compernolle Dirk, 2017

机译：资源匮乏语言的语音识别：隐马尔可夫模型系统中的数据共享
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

TRIPHONE BASED CONTINUOUS SPEECH RECOGNITION SYSTEM FOR TURKISH LANGUAGE USING HIDDEN MARKOV MODEL

摘要

著录项

相似文献

相关主题

期刊订阅