首页> 外文期刊>Signal processing >A unified language model for large vocabulary continuous speech recognition of Turkish
【24h】

A unified language model for large vocabulary continuous speech recognition of Turkish

机译:土耳其语大词汇量连续语音识别的统一语言模型

获取原文
获取原文并翻译 | 示例
       

摘要

We have designed a Turkish dictation system for newspaper content transcription application. Turkish is an agglutinative language with free word order. These characteristics of the language result in vocabulary explosion, large number of out-of-vocabulary (OOV) words and an increased complexity of w-gram language models in speech recognition when words are used as recognition units. In this paper, alternative language modeling units like "stems and endings", "stems and morphemes", and "syllables" are investigated instead of "words". These recognition units are compared in terms of vocabulary size, coverage, bigram perplexity and speech recognition performance. A combined model is proposed which aims to produce a balance between the OOV rate and the amount of phoneme sequence constraints on recognition units. The proposed model resulted in letter error rates (LER's) of approximately 28% for a speaker independent system and 20% for a speaker dependent system. These error rates are smaller compared to the traditional word-based model for newspaper content transcription application.
机译:我们为报纸内容转录应用程序设计了土耳其语听写系统。土耳其语是一种凝集性语言,带有自由词序。语言的这些特征会导致词汇爆炸,大量的语音(OOV)单词以及将单词用作识别单元时语音识别中w-gram语言模型的复杂性增加。在本文中,研究了替代语言建模单元,例如“词干和结尾”,“词干和词素”和“音节”,而不是“单词”。比较这些识别单元的词汇量,覆盖范围,二元困惑和语音识别性能。提出了一种组合模型,旨在在OOV速率和识别单元上的音素序列约束量之间取得平衡。所提出的模型对于独立于说话者的系统产生大约28%的字母错误率(LER),而对于独立于说话者的系统产生20%的字母错误率。与用于报纸内容转录应用的传统基于单词的模型相比,这些错误率较小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号