Multi-Round Transfer Learning for Low-Resource NMT Using Multiple High-Resource Languages

Maimaiti Mieradilijiang; Liu Yang; Luan Huanbo; Sun Maosong

首页> 外文期刊>ACM transactions on Asian language information processing >Multi-Round Transfer Learning for Low-Resource NMT Using Multiple High-Resource Languages

【24h】

Multi-Round Transfer Learning for Low-Resource NMT Using Multiple High-Resource Languages

机译：使用多种高资源语言进行低资源NMT的多次回传学习

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Neural machine translation (NMT) has made remarkable progress in recent years, but the performance of NMT suffers from a data sparsity problem since large-scale parallel corpora are only readily available for high-resource languages (HRLs). In recent days, transfer learning (TL) has been used widely in low-resource languages (LRLs) machine translation, while TL is becoming one of the vital directions for addressing the data sparsity problem in low-resource NMT. As a solution, a transfer learning method in NMT is generally obtained via initializing the low-resource model (child) with the high-resource model (parent). However, leveraging the original TL to low-resource models is neither able to make full use of highly related multiple HRLs nor to receive different parameters from the same parents. In order to exploit multiple HRLs effectively, we present a language-independent and straightforward multi-round transfer learning (MRTL) approach to low-resource NMT. Besides, with the intention of reducing the differences between high-resource and low-resource languages at the character level, we introduce a unified transliteration method for various language families, which are both semantically and syntactically highly analogous with each other. Experiments on low-resource datasets show that our approaches are effective, significantly outperform the state-of-the-art methods, and yield improvements of up to 5.63 BLEU points.

机译：近年来，神经机器翻译（NMT）取得了显着进步，但是NMT的性能受到数据稀疏性的困扰，因为大规模并行语料库仅可用于高资源语言（HRL）。近年来，转移学习（TL）已在低资源语言（LRL）机器翻译中得到广泛使用，而TL正在成为解决低资源NMT中数据稀疏性问题的重要方向之一。作为解决方案，通常通过将低资源模型（子级）初始化为高资源模型（父级）来获得NMT中的转移学习方法。但是，将原始TL用于低资源模型既不能充分利用高度相关的多个HRL，也不能从同一父级接收不同的参数。为了有效地利用多个HRL，我们提出了一种语言独立且直接的多轮转学制（MRTL）方法来实现资源贫乏的NMT。此外，为了在字符级别上减少高资源语言和低资源语言之间的差异，我们针对各种语言族引入了统一的音译方法，这些族在语义和句法上都非常相似。在低资源数据集上进行的实验表明，我们的方法是有效的，显着优于最新方法，并且最多可提高5.63 BLEU点。

著录项

来源
《ACM transactions on Asian language information processing》 |2019年第4期|38.1-38.26|共26页
作者
Maimaiti Mieradilijiang; Liu Yang; Luan Huanbo; Sun Maosong;
展开▼
作者单位

Tsinghua Univ Dept Comp Sci & Technol Beijing Natl Res Ctr Informat Sci & Technol BNRis Key Lab Intelligent Technol & Syst Inst Artificia Beijing Peoples R China|FIT Bldg 30 Shuang Qing Rd Beijing 100084 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Neural machine translation; transfer learning; high-resource language; low-resource language; multi-round; transliteration;

机译：神经机器翻译;转移学习;高资源语言资源匮乏的语言;多轮音译;

相似文献

外文文献
中文文献
专利

1. Transform, Combine, and Transfer: Delexicalized Transfer Parser for Low-resource Languages [J] . Das Ayan, Sarkar Sudeshna ACM transactions on Asian language information processing . 2020,第1期

机译：转换，合并和传输：用于低资源语言的非词化传输解析器
2. Feature learning for efficient ASR-free keyword spotting in low-resource languages [J] . Ewald van der Westhuizen, Herman Kamper, Raghav Menon, Computer speech and language . 2022,第Jana期

机译：特征学习以低资源语言的高效无论是无ASR的关键字拍摄
3. Improving NER Tagging Performance in Low-Resource Languages via Multilingual Learning [J] . Murthy Rudra, Khapra Mitesh M., Bhattacharyya Pushpak ACM transactions on Asian language information processing . 2019,第2期

机译：通过多语言学习提高低资源语言中的NER标签性能
4. Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data [C] . Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchiniala, Annual Meeting of the Association for Computational Linguistics;International Joint Conference on natural Language Processing . 2021

机译：调整高资源NMT模型以翻译低资源相关语言而无需并行数据
5. Leveraging Training Data from High-Resource Languages to Improve Dependency Parsing for Low-Resource Languages [D] . Jaja, Claire. 2014

机译：利用来自高资源语言的培训数据来改善对低资源语言的依赖关系解析
6. NMT1 and NMT3 N-Methyltransferase Activity Is Critical to Lipid Homeostasis Morphogenesis and Reproduction [O] . Weihua Chen, Hooman Salari, Matthew C. Taylor, 2018

机译：NMT1和NMT3 N-甲基转移酶活性对脂质稳态形态发生和繁殖至关重要
7. Hierarchical Transfer Learning for Multilingual, Multi-Speaker, and Style Transfer DNN-Based TTS on Low-Resource Languages [O] . Kurniawati Azizah, Mirna Adriani, Wisnu Jatmiko 2020

机译：低资源语言的多语言，多扬声器和样式转移DNN的分层转移学习

Multi-Round Transfer Learning for Low-Resource NMT Using Multiple High-Resource Languages

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅