首页> 外文会议> >Language Modelling Approaches for Turkish Large Vocabulary Continuous Speech Recognition Based on Lattice Rescoring

【24h】

Language Modelling Approaches for Turkish Large Vocabulary Continuous Speech Recognition Based on Lattice Rescoring

机译：基于格记录的土耳其大词汇量连续语音识别语言建模方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we have tried some language modelling approaches for Large Vocabulary Continuous Speech Recognition (LVCSR) of Turkish. The agglutinative nature of Turkish makes Turkish a challenging language in terms of speech recognition since it is impossible to include all possible words in the recognition lexicon. Therefore, instead of using words as recognition units, we use a data-driven sub-word approach called morphs. This method was previously applied to Finnish, Estonian and Turkish and promising recognition results were achieved compared to words as recognition units. In our database, we obtained Word Error Rates (WER) of 38.8% for the baseline word-based model and 33.9% for the baseline morph-based model. In addition, we tried some new methods. Recognition lattice outputs of each model were rescored with the root-based and root-class-based models for the word-based case and first morph-based model for the morph-based case. The word-root composition approach achieves a 0.5% increase in the recogni tion performance. However, other two approaches fail due to the non-robust estimates over the baseline models.

机译：在本文中，我们尝试了一些针对土耳其语的大词汇量连续语音识别（LVCSR）的语言建模方法。土耳其语的凝集特性使土耳其语在语音识别方面成为具有挑战性的语言，因为不可能在识别词典中包含所有可能的单词。因此，我们不是使用单词作为识别单位，而是使用一种称为词素的数据驱动子单词方法。该方法先前已应用于芬兰语，爱沙尼亚语和土耳其语，并且与单词作为识别单位相比，获得了可喜的识别结果。在我们的数据库中，对于基于单词的基线模型，我们获得了38.8％的单词错误率（WER），对于基于基于词素变化的模型，我们获得了33.9％的单词错误率。此外，我们尝试了一些新方法。对于基于单词的案例，使用基于词根和基于类别的模型对每个模型的识别晶格输出进行评分，而对于基于词素的案例，则使用基于词素的模型作为第一个模型。词根组合法可将识别性能提高0.5％。但是，由于对基线模型的估计不够可靠，因此其他两种方法均会失败。

著录项

来源
《》|2006年|P.1-4|共4页
会议地点
作者
Arisoy; E.; Saraclar; M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition [J] . Hori T., Hori C., Minami Y., IEEE transactions on audio, speech and language processing . 2007,第4期

机译：高效的基于WFST的单遍解码，具有即时假设，可极大地记录词汇量，并能连续语音识别
2. A unified language model for large vocabulary continuous speech recognition of Turkish [J] . Ebru Ansoy, Helin Dutagaci, Levent M. Arslan Signal processing . 2006,第10期

机译：土耳其语大词汇量连续语音识别的统一语言模型
3. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
4. Language Modelling Approaches for Turkish Large Vocabulary Continuous Speech Recognition Based on Lattice Rescoring [C] . Arisoy E., Saraclar M. IEEE Signal Processing and Communications Applications . 2006

机译：土耳其大型词汇持续语音识别语言建模方法
5. Modeling lexical tones for Mandarin large vocabulary continuous speech recognition. [D] . Lei, Xin. 2006

机译：为普通话大词汇量连续语音识别建模词汇声调。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition [O] . Kumar, Shankar, Nirschl, Michael, Holtmann-Rice, Daniel, 2017

机译：长期短期记忆语言模型的格子重新策略在语音识别中

Language Modelling Approaches for Turkish Large Vocabulary Continuous Speech Recognition Based on Lattice Rescoring

摘要

著录项

相似文献

相关主题

期刊订阅