Residual Stacking of RNNs for Neural Machine Translation

机译：用于神经机翻译的RNN的残余堆叠

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To enhance Neural Machine Translation models, several obvious ways such as enlarging the hidden size of recurrent layers and stacking multiple layers of RNN can be considered. Surprisingly, we observe that using naively stacked RNNs in the decoder slows down the training and leads to degradation in performance. In this paper, We demonstrate that applying residual connections in the depth of stacked RNNs can help the optimization, which is referred to as residual stacking. In empirical evaluation, residual stacking of decoder RNNs gives superior results compared to other methods of enhancing the model with a fixed parameter budget. Our submitted systems in WAT2016 are based on a NMT model ensemble with residual stacking in the decoder. To further improve the performance, we also attempt various methods of system combination in our experiments.

机译：为了增强神经机器翻译模型，可以考虑扩大复发层的隐藏尺寸和堆叠多层RNN的多种明显方式。令人惊讶的是，我们观察到解码器中的天然堆叠的RNN减慢训练并导致性能下降。在本文中，我们证明在堆叠的RNN的深度中应用残留连接可以帮助优化，这被称为残差堆叠。在经验评估中，与通过固定参数预算增强模型的其他方法相比，RNN的残余堆叠给出了优异的结果。我们在WAT2016中提交的系统基于NMT模型集合，在解码器中具有残留堆叠。为了进一步提高性能，我们还尝试在我们的实验中尝试各种系统组合方法。

著录项

来源
《Workshop on Asian translation》|2016年|xiv 229 p.|共7页
会议地点
作者
Raphael Shu; Akiva Miura;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. The use of machine translation algorithm based on residual and LSTM neural network in translation teaching [J] . Beibei Ren PLoS One . 2020,第11期

机译：基于残差和LSTM神经网络的机器翻译算法在翻译教学中的使用
2. Post-editing neural machine translation versus phrase-based machine translation for English-Chinese [J] . Yanfang Jia, Michael Carl, Xiangling Wang Machine translation . 2019,第1a2期

机译：英译后的神经机器翻译与基于短语的机器翻译
3. Post-editing neural machine translation versus phrase-based machine translation for English-Chinese [J] . Yanfang Jia, Michael Carl, Xiangling Wang Machine translation . 2019,第1a2期

机译：编辑后神经机翻译与基于短语的机器翻译英语 - 中文翻译
4. Residual Stacking of RNNs for Neural Machine Translation [C] . Raphael Shu, Akiva Miura Workshop on Asian translation . 2016

机译：神经网络翻译的RNN残差堆叠
5. Larger-Context Neural Machine Translation [D] . Jean, Sébastien. 2021

机译：较大的上下文神经机翻译
6. The use of machine translation algorithm based on residual and LSTM neural network in translation teaching [O] . Beibei Ren 2020

机译：基于残差和LSTM神经网络的机器翻译算法在翻译教学中的使用
7. Global-Context Neural Machine Translation through Target-Side Attentive Residual Connections [O] . Werlen, Lesly Miculicich, Pappas, Nikolaos, Ram, Dhananjay, 2017

机译：基于目标侧注意的全局语境神经机器翻译剩余连接

Residual Stacking of RNNs for Neural Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅