Addressing Word-order Divergence in Multilingual Neural Machine Translation for Extremely Low Resource Languages

机译：解决极少资源语言的多语言神经机器翻译中的字序差异

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Transfer learning approaches for Neural Machine Translation (NMT) trains a NMT model on an assisting language-target language pair (parent model) which is later fine-tuned for the source language-target language pair of interest (child model), with the target language being the same. In many cases, the assisting language has a different word order from the source language. We show that divergent word order adversely limits the benefits from transfer learning when little to no parallel corpus between the source and target language is available. To bridge this divergence, we propose to pre-order the assisting language sentences to match the word order of the source language and train the parent model. Our experiments on many language pairs show that bridging the word order gap leads to major improvements in the translation quality in extremely low-resource scenarios.

机译：神经机器翻译（NMT）的转移学习方法在辅助语言-目标语言对（父模型）上训练NMT模型，随后针对目标语言对（目标模型）对源语言-目标语言对进行了微调。语言是一样的。在许多情况下，辅助语言的词序与源语言的词序不同。我们表明，当源语言和目标语言之间几乎没有平行语料库时，发散的单词顺序不利地限制了迁移学习的好处。为了弥合这种差异，我们建议对辅助语言句子进行预排序以匹配源语言的词序并训练父模型。我们在许多语言对上的实验表明，在极少资源的情况下，弥合单词顺序差距可以大大改善翻译质量。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|3868-3873|共6页
会议地点
作者
Rudra Murthy V; Anoop Kunchukuttan; Pushpak Bhattacharyya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Extremely low-resource neural machine translation for Asian languages [J] . Rubino Raphael, Marie Benjamin, Dabre Raj, Machine translation . 2020,第4期

机译：极低资源的神经机用于亚洲语言翻译
2. A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages [J] . Zennaki O., Semmar N., Besacier L. Natural language engineering . 2019,第PTa1期

机译：用于诱导多语言资源的神经方法和用于低资源语言的自然语言处理工具
3. Neural machine translation of low-resource languages using SMT phrase pair injection [J] . Sukanta Sen, Mohammed Hasanuzzaman, Asif Ekbal, Natural language engineering . 2021,第Pta3期

机译：使用SMT短语对注射的低资源语言的神经机翻译
4. Addressing Word-order Divergence in Multilingual Neural Machine Translation for Extremely Low Resource Languages [C] . Rudra Murthy V, Anoop Kunchukuttan, Pushpak Bhattacharyya Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：解决极其低资源语言的多语种神经机翻译中的字阶分解
5. Turkic Interlingua: A Case Study of Machine Translation in Low-Resource Languages [D] . Mirzakhalov, Jamshidbek. 2021

机译：Turikic Interlingua：一种低资源语言机器翻译的案例研究
6. Pseudotext Injection and Advance Filtering of Low-Resource Corpus for Neural Machine Translation [O] . Michael Adjeisah, Guohua Liu, Douglas Omwenga Nyabuga, 2021

机译：神经电机翻译低资源语料的假义注射和预先滤波
7. Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages [O] . Rudra Murthy, Anoop Kunchukuttan, Pushpak Bhattacharyya 2019

机译：解决极其低资源语言的多语种神经机翻译中的字阶分解

Addressing Word-order Divergence in Multilingual Neural Machine Translation for Extremely Low Resource Languages

摘要

著录项

相似文献

相关主题

期刊订阅