JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

机译：JASS：日本特定序列，用于序列训练神经机翻译

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Neural machine translation (NMT) needs large parallel corpora for state-of-the-art translation quality. Low-resource NMT is typically addressed by transfer learning which leverages large monolingual or parallel corpora for pre-training. Monolingual pre-training approaches such as MASS (MAsked Sequence to Sequence) are extremely effective in boosting NMT quality for languages with small parallel corpora. However, they do not account for linguistic information obtained using syntactic analyzers which is known to be invaluable for several Natural Language Processing (NLP) tasks. To this end, we propose JASS, Japanese-specific Sequence to Sequence, as a novel pre-training alternative to MASS for NMT involving Japanese as the source or target language. JASS is joint BMASS (Bunsetsu MASS) and BRSS (Bunsetsu Reordering Sequence to Sequence) pre-training which focuses on Japanese linguistic units called bunsetsus. In our experiments on ASPEC Japanese-English and News Commentary Japanese-Russian translation we show that JASS can give results that are competitive with if not better than those given by MASS. Furthermore, we show for the first time that joint MASS and JASS pre-training gives results that significantly surpass the individual methods indicating their complementary nature. We will release our code, pre-trained models and bunsetsu annotated data as resources for researchers to use in their own NLP tasks.

机译：神经机翻译（NMT）需要大的平行语料库以获得最先进的翻译质量。低资源NMT通常通过转移学习来解决，从而利用大型单机或平行语料库进行预培训。单体式预训练方法，如质量（屏蔽序列）非常有效地提高与小并行语言的语言的NMT质量。但是，它们不考虑使用句法分析仪获得的语言信息，这些信息被称为几种自然语言处理（NLP）任务非常有价值。为此，我们向序列提出Jass，日本特定的序列，作为涉及日语作为源语言或目标语言的NMT的新型预训练替代品。 Jass是联合BMASS（Bunsetsu Mass）和BRSS（Bunsetsu重新排序序列）预先培训，其侧重于叫做Bunsetsus的日本语言单位。在我们的Aspec日语和新闻注释日语 - 俄语翻译中，我们表明JASS可以给出与群众给出的那些竞争的结果。此外，我们首次展示联合群众和JASS预训练提供了显着超越各个方法的结果，这表明其互补性。我们将发布我们的代码，预先训练的模型和Bunsetsu注释数据作为研究人员以自己的NLP任务使用的资源。

著录项

来源
《International Conference on Language Resources and Evaluation》|2020年|3683-3691|共9页
会议地点
作者
Zhuoyuan Mao; Fabien Cromieres; Raj Dabre; Haiyue Song; Sadao Kurohashi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
pre-training; neural machine translation; bunsetsu; low resource;

机译：预训练;神经机翻译;bunsetsu;低资源;

相似文献

外文文献
中文文献
专利

1. Enhanced Neural Machine Translation by Joint Decoding with Word and POS-tagging Sequences [J] . Feng Xiaocheng, Feng Zhangyin, Zhao Wanlong, Mobile networks & applications . 2020,第5期

机译：通过用单词和POS标记序列联合解码增强神经机平移
2. Translating math formula images to LaTeX sequences using deep neural networks with sequence-level training [J] . Wang Zelun, Liu Jyh-Charn International Journal on Document Analysis and Recognition . 2021,第1a2期

机译：使用具有序列级培训的深神经网络将数学公式图像转换为乳胶序列
3. Lattice-to-sequence attentional Neural Machine Translation models [J] . Tan Zhixing, Su Jinsong, Wang Boli, Neurocomputing . 2018,第APRa5期

机译：格序神经神经机器翻译模型
4. Latent Part-of-Speech Sequences for Neural Machine Translation [C] . Xuewen Yang, Yingru Liu, Dongliang Xie, International joint conference on natural language processing;Conference on empirical methods in natural language processing . 2019

机译：神经机器翻译的潜在词性序列
5. Evolving neural net circuit modules to detect characters of the alphabet and sequences of characters (words) using the cellular automata module-brain machine. [D] . DeCesare, Derek. 2001

机译：不断发展的神经网络电路模块，使用元胞自动机模块-大脑机器来检测字母字符和字符序列（单词）。
6. Chromatin interaction neural network (ChINN): a machine learning-based method for predicting chromatin interactions from DNA sequences [O] . Fan Cao, Yu Zhang, Yichao Cai, 2021

机译：染色质互动神经网络（Chinn）：一种基于机器学习的方法用于预测DNA序列的染色质相互作用
7. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension [O] . Mike Lewis, Yinhan Liu, Naman Goyal, 2020

机译：BART：用于自然语言生成，翻译和理解的序列对序列预培训

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅