首页> 外文会议>International Conference of Computer and Information Technology >A Corpus Based N-gram Hybrid Approach of Bengali to English Machine Translation
【24h】

A Corpus Based N-gram Hybrid Approach of Bengali to English Machine Translation

机译:基于孟加拉语料库的孟加拉语N-gram混合方法进行英语机器翻译

获取原文

摘要

Machine translation means automatic translation which is performed using computer software. There are several approaches to machine translation, some of them need extensive linguistic knowledge while others require enormous statistical calculations. This paper presents a hybrid method, integrating corpus based approach and statistical approach for translating Bengali sentences into English with the help of N-gram language model. The corpus based method finds the corresponding target language translation of sentence fragments, selecting the best match text from the bilingual corpus to acquire knowledge while the N-gram model rearranges the sentence constituents to get an accurate translation without employing external linguistic rules. A variety of Bengali sentences, including various structures and verb tenses are considered to translate through the new system. The performance of the proposed system is evaluated in terms of adequacy, fluency, WER, and BLEU score. The assessment scores are compared with other conventional approaches as well as with Google Translate, a well-known free machine translation service by Google. It has been found that experimental results of the work provide higher scores over Google Translate and other methods with less computational cost.
机译:机器翻译是指使用计算机软件执行的自动翻译。机器翻译有几种方法,其中一些需要广泛的语言知识,而另一些则需要大量的统计计算。本文提出了一种混合方法,结合基于语料库的方法和统计方法,借助N-gram语言模型将孟加拉语句子翻译成英语。基于语料库的方法找到句子片段的相应目标语言翻译,从双语语料库中选择最佳匹配文本以获取知识,而N-gram模型重新排列句子成分以获得准确的翻译,而无需采用外部语言规则。各种各样的孟加拉语句子,包括各种结构和动词时态,都被认为可以通过新系统进行翻译。所提出系统的性能通过充分性,流畅性,WER和BLEU得分进行评估。评估分数将与其他常规方法以及Google Translate进行比较,Google Translate是Google著名的免费机器翻译服务。已经发现,该工作的实验结果比Google Translate和其他方法具有更高的分数,并且计算成本更低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号