首页> 外文会议> >Language Modelling Approaches for Turkish Large Vocabulary Continuous Speech Recognition Based on Lattice Rescoring
【24h】

Language Modelling Approaches for Turkish Large Vocabulary Continuous Speech Recognition Based on Lattice Rescoring

机译:基于格记录的土耳其大词汇量连续语音识别语言建模方法

获取原文

摘要

In this paper, we have tried some language modelling approaches for Large Vocabulary Continuous Speech Recognition (LVCSR) of Turkish. The agglutinative nature of Turkish makes Turkish a challenging language in terms of speech recognition since it is impossible to include all possible words in the recognition lexicon. Therefore, instead of using words as recognition units, we use a data-driven sub-word approach called morphs. This method was previously applied to Finnish, Estonian and Turkish and promising recognition results were achieved compared to words as recognition units. In our database, we obtained Word Error Rates (WER) of 38.8% for the baseline word-based model and 33.9% for the baseline morph-based model. In addition, we tried some new methods. Recognition lattice outputs of each model were rescored with the root-based and root-class-based models for the word-based case and first morph-based model for the morph-based case. The word-root composition approach achieves a 0.5% increase in the recogni tion performance. However, other two approaches fail due to the non-robust estimates over the baseline models.
机译:在本文中,我们尝试了一些针对土耳其语的大词汇量连续语音识别(LVCSR)的语言建模方法。土耳其语的凝集特性使土耳其语在语音识别方面成为具有挑战性的语言,因为不可能在识别词典中包含所有可能的单词。因此,我们不是使用单词作为识别单位,而是使用一种称为词素的数据驱动子单词方法。该方法先前已应用于芬兰语,爱沙尼亚语和土耳其语,并且与单词作为识别单位相比,获得了可喜的识别结果。在我们的数据库中,对于基于单词的基线模型,我们获得了38.8%的单词错误率(WER),对于基于基于词素变化的模型,我们获得了33.9%的单词错误率。此外,我们尝试了一些新方法。对于基于单词的案例,使用基于词根和基于类别的模型对每个模型的识别晶格输出进行评分,而对于基于词素的案例,则使用基于词素的模型作为第一个模型。词根组合法可将识别性能提高0.5%。但是,由于对基线模型的估计不够可靠,因此其他两种方法均会失败。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号