Learning to Generate Word- and Phrase-Embeddings for Efficient Phrase-Based Neural Machine Translation

机译：学习生成词和短语嵌入，以实现基于短语的高效神经机器翻译

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural machine translation (NMT) often fails in one-to-many translation, e.g., in the translation of multi-word expressions, compounds, and collocations. To improve the translation of phrases, phrase-based NMT systems have been proposed; these typically combine word-based NMT with external phrase dictionaries or with phrase tables from phrase-based statistical MT systems. These solutions introduce a significant overhead of additional resources and computational costs. In this paper, we introduce a phrase-based NMT model built upon continuous-output NMT, in which the decoder generates embeddings of words or phrases. The model uses a fertility module, which guides the decoder to generate em-beddings of sequences of varying lengths. We show that our model learns to translate phrases better, performing on par with state of the art phrase-based NMT. Since our model does not resort to softmax computation over a huge vocabulary of phrases, its training time is about 112x faster than the baseline.

机译：神经机器翻译（NMT）通常在一对多翻译中失败，例如在多词表达，复合词和搭配词的翻译中。为了改进短语的翻译，已经提出了基于短语的NMT系统。这些通常将基于单词的NMT与外部短语词典或基于短语的统计MT系统中的短语表结合使用。这些解决方案带来了额外资源和计算成本的巨大开销。在本文中，我们介绍了一种基于短语的NMT模型，该模型建立在连续输出NMT之上，其中解码器生成单词或短语的嵌入。该模型使用生育模块，该模块指导解码器生成长度可变的序列的嵌入。我们证明了我们的模型可以更好地翻译短语，与基于词组的NMT的表现相当。由于我们的模型没有在庞大的词组词汇上求助于softmax计算，因此其训练时间比基线快约112倍。

著录项

来源
《Workshop on neural generation and translation》|2019年|241-248|共8页
会议地点
作者
Chan Young Park; Yulia Tsvetkov;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Post-editing neural machine translation versus phrase-based machine translation for English-Chinese [J] . Yanfang Jia, Michael Carl, Xiangling Wang Machine translation . 2019,第1a2期

机译：英译后的神经机器翻译与基于短语的机器翻译
2. Post-editing neural machine translation versus phrase-based machine translation for English-Chinese [J] . Yanfang Jia, Michael Carl, Xiangling Wang Machine translation . 2019,第1a2期

机译：编辑后神经机翻译与基于短语的机器翻译英语 - 中文翻译
3. Improving neural machine translation through phrase-based soft forced decoding [J] . Zhang Jingyi, Utiyama Masao, Sumita Eiichro, Machine translation . 2020,第1期

机译：通过基于短语的软强制解码改善神经电脑翻译
4. Neural-Based Machine Translation System Outperforming Statistical Phrase-Based Machine Translation for Low-Resource Languages [C] . Muskaan Singh, Ravinder Kumar, Inderveer Chana International Conference on Contemporary Computing . 2019

机译：低资源语言的基于神经的机器翻译系统胜过基于统计短语的机器翻译
5. Emerging Opportunities in Machine Learning Hardware Acceleration: From Advanced Neural Networks Implementation to Ultra-efficient Deep Learning Framework Using Next Generation Technology [D] . ?Cai, Ruizhe 2020

机译：机器学习硬件加速的新兴机会：从先进的神经网络实现，使用下一代技术实现超高效的深度学习框架
6. A Neural Machine Translation Model for Arabic Dialects That Utilises Multitask Learning (MTL) [O] . Laith H. Baniata, Seyoung Park, Seong-Bae Park 2018

机译：利用多任务学习（MTL）的阿拉伯语神经机器翻译模型
7. Phrase-Based Neural Unsupervised Machine Translation [O] . Guillaume Lample, Myle Ott, Alexis Conneau, 2018

机译：基于短语和神经无人监督的机器翻译

Learning to Generate Word- and Phrase-Embeddings for Efficient Phrase-Based Neural Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅