首页> 外文期刊>ACM transactions on Asian language information processing >Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System
【24h】

Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System

机译:增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较,以提高泰米尔语语音识别系统的性能

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a new technique of language modeling for a highly inflectional Dravidian language, Tamil. It aims to alleviate the main problems encountered in processing of Tamil language, like enormous vocabulary growth caused by the large number of different forms derived from one word. The size of the vocabulary was reduced by, decomposing the words into stems and endings and storing these sub word units (morphemes) in the vocabulary separately. A enhanced morpheme-based language model was designed for the inflectional language Tamil. The enhanced morpheme-based language model was trained on the decomposed corpus. The perplexity and Word Error Rate (WER) were obtained to check the efficiency of the model for Tamil speech recognition system. The results were compared with word-based bigram and trigram language models, distance based language model, dependency based language model and class based language model. From the results it was analyzed that the enhanced morpheme-based trigram model with Katz back-off smoothing effect improved the performance of the Tamil speech recognition system when compared to the word-based language models.
机译:本文介绍了一种高度折衷的德拉维语言泰米尔语的语言建模新技术。它旨在减轻泰米尔语处理过程中遇到的主要问题,例如由一个单词衍生出的大量不同形式导致的词汇量大量增加。通过将单词分解为词干和结尾并将这些子单词单元(词素)分别存储在词汇表中来减小词汇表的大小。一种增强的基于词素的语言模型被设计用于泰米尔语的屈折语言。增强的基于语素的语言模型是在分解后的语料库上训练的。获得了困惑度和单词错误率(WER)以检查该模型的泰米尔语语音识别系统的效率。将结果与基于单词的二元语言和三元组语言模型,基于距离的语言模型,基于依赖的语言模型和基于类的语言模型进行比较。从结果分析,与基于单词的语言模型相比,具有Katz后退平滑效果的增强的基于词素的trigram模型提高了泰米尔语语音识别系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号