首页> 外文OA文献 >Comparing rule-based and data-driven approaches to Spanish-to-Basque machine translation
【2h】

Comparing rule-based and data-driven approaches to Spanish-to-Basque machine translation

机译:将基于规则和数据驱动的方法与西班牙语到巴斯克语的机器翻译进行比较

摘要

In this paper, we compare the rule-based and data-drivenudapproaches in the context of Spanish-to-Basque Machine Translation. The rule-based system we consider has been developed specifically for Spanish-to-Basque machine translation, and is tuned to this language pair. On the contrary, the data-driven system we use is generic, and has not been specifically designed to deal with Basque. Spanish-to-Basque Machine Translation is a challenge for data-drivenudapproaches for at least two reasons. First, there is lack ofudbilingual data on which a data-driven MT system can be trained. Second, Basque is a morphologically-rich agglutinative language and translating to Basque requires a huge generation of morphological information, a difficult task for a generic system not specifically tuned to Basque. We present the results of a series of experiments, obtained on two different corpora, one being “in-domain” and theudother one “out-of-domain” with respect to the data-drivenudsystem. We show that n-gram based automatic evaluation and edit-distance-based human evaluation yield two different sets of results. According to BLEU, the data-driven system outperforms the rule-based system on the in-domain data, while according to the human evaluation, the rule-basedudapproach achieves higher scores for both corpora.
机译:在本文中,我们在西班牙语到巴斯克机器翻译的背景下比较了基于规则和数据驱动的 udapproaches。我们考虑的基于规则的系统是专门为西班牙语到巴斯克语的机器翻译而开发的,并已针对该语言对进行了调整。相反,我们使用的数据驱动系统是通用的,并且并不是专门为处理巴斯克而设计的。西班牙语到巴斯克语的机器翻译对于数据驱动的 udapproach来说是一个挑战,至少有两个原因。首先,缺少辅助数据,可以在其上训练数据驱动的MT系统。其次,巴斯克语是一种形态丰富的凝集语言,要翻译成巴斯克语,需要生成大量的形态信息,这对于不专门针对巴斯克语的通用系统而言是一项艰巨的任务。我们介绍了在两个不同的语料库上获得的一系列实验的结果,其中一个是关于数据驱动的 udsystem的“域内”,另一个是“域外”。我们显示基于n-gram的自动评估和基于编辑距离的人类评估产生两组不同的结果。根据BLEU的说法,数据驱动系统在域内数据上的性能优于基于规则的系统,而根据人工评估,基于规则的 udapproach在两个语料库上均获得更高的分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号