首页> 外文会议>International conference on Asian language processing >Neural Machine Translation for Cebuano to Tagalog with Subword Unit Translation
【24h】

Neural Machine Translation for Cebuano to Tagalog with Subword Unit Translation

机译:Cebuano到他加禄语的神经机器翻译及子词单位翻译

获取原文

摘要

The Philippines is an archipelago composed of 7, 641 different islands with more than 150 different languages. This linguistic differences and diversity, though may be seen as a beautiful feature, have contributed to the difficulty in the promotion of educational and cultural development of different domains in the country. An effective machine translation system solely dedicated to cater Philippine languages will surely help bridge this gap. In this research work, a never before applied approach for language translation to a Philippine language was used for a Cebuano to Tagalog translator. A Recurrent Neural Network was used to implement the translator using OpenNMT sequence modeling tool in TensorFlow. The performance of the translation was evaluated using the BLEU Score metric. For the Cebuano to Tagalog translation, BLEU produced a score of 20.01. A subword unit translation for verbs and copyable approach was performed where commonly seen mistranslated words from the source to the target were corrected. The BLEU score increased to 22.87. Though slightly higher, this score still indicates that the translation is somehow understandable but is not yet considered as a good translation.
机译:菲律宾是一个群岛,由7个,641个不同的岛屿以及150多种不同的语言组成。这种语言上的差异和多样性虽然可以看作是一个美丽的特征,但却加剧了在该国促进不同领域的教育和文化发展的困难。专门致力于迎合菲律宾语言的有效机器翻译系统肯定会帮助弥合这一差距。在这项研究工作中,从Cebuano到他加禄语的翻译人员使用了从未应用过的将语言翻译成菲律宾语言的方法。使用TensorFlow中的OpenNMT序列建模工具使用递归神经网络来实现翻译器。使用BLEU评分标准评估翻译的效果。对于Cebuano到他加禄语的翻译,BLEU的得分为20.01。进行了动词和可复制方法的亚词单位翻译,其中纠正了从源到目标的常见误译词。 BLEU分数提高到22.87。尽管分数稍高,但该分数仍表明该翻译是可以理解的,但尚未被认为是很好的翻译。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号