Biasing Attention-Based Recurrent Neural Networks Using External Alignment Information

机译：使用外部比对信息对基于注意力的偏向神经网络进行偏见

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This work explores extending attention-based neural models to include alignment information as input. We modify the attention component to have dependence on the current source position. The attention model is then used as a lexical model together with an additional alignment model to generate translation. The attention model is trained using external alignment information, and it is applied in decoding by performing beam search over the lexical and alignment hypotheses. The alignment model is used to score these alignment candidates. We demonstrate that the attention layer is capable of using the alignment information to improve over the baseline attention model that uses no such alignments. Our experiments are performed on two tasks: WMT 2016 English→Romaman and WMT 2017 German→English.

机译：这项工作探索扩展基于注意力的神经模型，以包括对齐信息作为输入。我们修改注意力组件以使其依赖于当前源位置。然后，将注意力模型与附加对齐模型一起用作词汇模型，以生成翻译。使用外部对齐信息训练注意力模型，并通过对词汇和对齐假设进行波束搜索将其应用于解码。对齐模型用于对这些对齐候选进行评分。我们证明关注层能够使用对齐信息来改进不使用此类对齐的基线关注模型。我们的实验是在两项任务上执行的：WMT 2016 English→Romaman和WMT 2017 German→English。

著录项

来源
《Second conference on machine translation》|2017年|108-117|共10页
会议地点 Copenhagen(DK)
作者
Tamer Alkhouli; Hermann Ney;
展开▼
作者单位

Human Language Technology and Pattern Recognition Group Computer Science Department RWTH Aachen University D-52056 Aachen, Germany;

Human Language Technology and Pattern Recognition Group Computer Science Department RWTH Aachen University D-52056 Aachen, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Combining attention-based bidirectional gated recurrent neural network and two-dimensional convolutional neural network for document-level sentiment classification [J] . Liu Fagui, Zheng Jingzhong, Zheng Lailei, Neurocomputing . 2020,第Jana2期

机译：结合基于注意力的双向门控递归神经网络和二维卷积神经网络进行文档级情感分类
2. Attention-based hierarchical recurrent neural networks for MOOC forum posts analysis [J] . Capuano Nicola, Caballe Santi, Conesa Jordi, Journal of ambient intelligence and humanized computing . 2021,第11期

机译：基于关注的分层经常性神经网络，用于MooC论坛帖子分析
3. Attention-based bidirectional gated recurrent unit neural networks for well logs prediction and lithology identification [J] . Zeng Lili, Ren Weijian, Shan Liqun Neurocomputing . 2020,第Nova13期

机译：基于注意的双向门控复发单元神经网络，用于良好的日志预测和岩性识别
4. Biasing Attention-Based Recurrent Neural Networks Using External Alignment Information [C] . Tamer Alkhouli, Hermann Ney Conference on machine translation . 2017

机译：使用外部对齐信息偏置基于关注的经常性神经网络
5. Time Series Forecasting Using Dual-Stage Attention-Based Recurrent Neural Network [D] . Moradi, Mahdi. 2020

机译：采用双阶段关注经常性神经网络预测的时间序列预测
6. Single-modal and multi-modal false arrhythmia alarm reduction using attention-based convolutional and recurrent neural networks [O] . Sajad Mousavi, Atiyeh Fotoohinasab, Fatemeh Afghah 2020

机译：使用基于注意力的卷积和经常性神经网络的单模和多模态假心律失常报警
7. Biasing Attention-Based Recurrent Neural Networks Using External Alignment Information [O] . Tamer Alkhouli, Hermann Ney 2017

机译：使用外部对齐信息偏置基于关注的经常性神经网络

Biasing Attention-Based Recurrent Neural Networks Using External Alignment Information

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅