首页> 外文会议>International Conference on Computer, Communication and Signal Processing >Improving the Performance of Low-Resource SMT Using Neural-Inspired Sentence Generator
【24h】

Improving the Performance of Low-Resource SMT Using Neural-Inspired Sentence Generator

机译:使用神经启发句子发生器提高低资源SMT的性能

获取原文

摘要

Machine translation (MT) is an important aspect of natural language processing (NLP) which uses bilingual data set and other language assets to translate text from a source language to text in a target language. The widely used statistical machine translation (SMT) system mainly relies on translation memories and glossaries to learn language pattern and translation rules. However the usage of these SMT systems under low-resource conditions remain a challenge. These SMT systems have a higher output quality when trained using domain specific training data since the texts belonging to same domain follow same pattern or usage of words. The current paper aims to develop and improve the performance of an SMT system under low-resource conditions for translation of health domain specific text from Tamil-to-English and English-to-Tamil. The translation quality of the machine translation system is improved using tuning based on minimum error rate training (MERT) and a novel neural-inspired sentence generator as a post-processor. The quality of translation and its performance analysis is evaluated in terms of the bilingual language understudy (BLEU) score and translation edit rate (TER).
机译:机器翻译(MT)是自然语言处理(NLP)的一个重要方面,它使用双语数据集和其他语言资产来将文本从源语言转换为目标语言中的文本。广泛使用的统计机器翻译(SMT)系统主要依赖于翻译记忆和词汇表来学习语言模式和翻译规则。然而,在低资源条件下使用这些SMT系统仍然是一个挑战。这些SMT系统在使用域特定培训数据训练时具有更高的输出质量,因为属于同一域的文本遵循相同的模式或单词的使用。目前的论文旨在在低资源条件下开发和改善SMT系统的性能,从泰米尔到英语和英语到泰米尔的健康域特定文本翻译。使用基于最小错误率训练(MERT)和新颖的神经启发句子发生器作为后处理器的调谐,改善了机器翻译系统的翻译质量。在双语语言升值(BLEU)评分和翻译编辑率(TER)方面评估翻译质量及其性能分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号