Effectively training neural machine translation models with monolingual data

Yang Zhen; Chen Wei; Wang Feng; Xu Bo

首页> 外文期刊>Neurocomputing >Effectively training neural machine translation models with monolingual data

【24h】

Effectively training neural machine translation models with monolingual data

机译：用单语数据有效地训练神经机器翻译模型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Improving neural machine translation models (NMT) with monolingual data has aroused more and more interests in this area and back-translation for monolingual data augmentation Sennrich et al. (2016) has been taken as a promising development recently. While the naive back-translation approach improves the translation performance substantially, we notice that its usage for monolingual data is not so effective because traditional NMT models make no distinction between the true parallel corpus and the back translated synthetic parallel corpus. This paper proposes a gate-enhanced NMT model which makes use of monolingual data more effectively. The central idea is to separate the data flow of monolingual data and parallel data into different channels by the elegant designed gate, which enables the model to perform different transformations according to the type of the input sequence, i.e., monolingual data and parallel data. Experiments on Chinese-English and English-German translation tasks show that our approach achieves substantial improvements over strong baselines and the gate-enhanced NMT model can utilize the source-side and target-side monolingual data at the same time. (C) 2018 Elsevier B.V. All rights reserved.

机译：用单语数据改进神经机器翻译模型（NMT）引起了这一领域的越来越多的兴趣，并且对单语数据增强进行了反向翻译。（2016）最近被视为有希望的发展。尽管幼稚的逆向翻译方法大大提高了翻译性能，但我们注意到，由于传统的NMT模型没有区分真实的并行语料库和反向翻译的合成并行语料库，因此它在单语数据中的使用效果不佳。本文提出了一种门增强型NMT模型，该模型可以更有效地利用单语数据。中心思想是通过设计精美的门将单语数据和并行数据的数据流分成不同的通道，这使模型能够根据输入序列的类型（即单语数据和并行数据）执行不同的转换。在汉英和英德翻译任务上的实验表明，我们的方法在强大的基线上取得了实质性的改进，并且门增强的NMT模型可以同时利用源端和目标端的单语数据。（C）2018 Elsevier B.V.保留所有权利。

著录项

来源
《Neurocomputing》 |2019年第14期|240-247|共8页
作者
Yang Zhen; Chen Wei; Wang Feng; Xu Bo;
展开▼
作者单位

Chinese Acad Sci, Inst Automat, 95 ZhongGuanCun East Rd, Beijing 100190, Peoples R China;

Univ Chinese Acad Sci, Beijing, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Neural machine translation; Monolingual data; Gate-enhanced; Source-side and target-side; Effectively;

机译：神经机器翻译;单语数据;门控增强;源端和目标端;有效;

相似文献

外文文献
中文文献
专利

1. A Hybrid Approach for Improved Low Resource Neural Machine Translation using Monolingual Data [J] . Idris Abdulmumin, Bashir Shehu Galadanci, Abubakar Isa, Engineering Letters . 2021,第4期

机译：一种使用单晶体数据改进低资源神经机平移的混合方法
2. Phrase Table Induction Using Monolingual Data for Low-Resource Statistical Machine Translation [J] . Marie Benjamin, Fujita Atsushi ACM transactions on Asian language information processing . 2018,第3期

机译：使用单语数据进行短语表归纳以进行低资源统计机器翻译
3. Explicitly Modeling Word Translations in Neural Machine Translation [J] . Han Dong, Li Junhui, Li Yachao, ACM transactions on Asian language information processing . 2020,第1期

机译：在神经机器翻译中显式建模单词翻译
4. Joint Training for Neural Machine Translation Models with Monolingual Data [C] . Zhirui Zhang, Shujie Liu, Mu Li, AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：单晶体数据的神经机翻译模型联合培训
5. Machine learning approaches for dealing with limited bilingual training data in statistical machine translation. [D] . Haffari, Gholamreza. 2009

机译：在统计机器翻译中用于处理有限的双语培训数据的机器学习方法。
6. A Chaotic Neural Network Model for English Machine Translation Based on Big Data Analysis [O] . Qianyu Cao, Hanmei Hao 2021

机译：基于大数据分析的英式电机翻译混沌神经网络模型
7. Improving Neural Machine Translation Models with Monolingual Data [O] . Sennrich, Rico, Haddow, Barry, Birch, Alexandra 2016

机译：用单语数据改进神经机器翻译模型

Effectively training neural machine translation models with monolingual data

摘要

著录项

相似文献

相关主题

期刊订阅