Transformer-based Automatic Post-Editing Model with Joint Encoder and Multi-source Attention of Decoder

机译：联合编码器与解码器多源关注的基于变压器的自动后编辑模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes POSTECH's submission to the WMT 2019 shared task on Automatic Post-Editing (APE). In this paper, we propose a new multi-source APE model by extending Transformer. The main contributions of our study are that we 1) reconstruct the encoder to generate a joint representation of translation (ml) and its src context, in addition to the conventional src encoding and 2) suggest two types of multi-source attention layers to compute attention between two outputs of the encoder and the decoder state in the decoder. Furthermore, we train our model by applying various teacher-forcing ratios to alleviate exposure bias. Finally, we adopt the ensemble technique across variations of our model. Experiments on the WMT19 English-German APE data set show improvements in terms of both TER and BLEU scores over the baseline. Our primary submission achieves -0.73 in TER and +1.49 in BLEU compared to the baseline, and ranks second among all submitted systems.

机译：本文介绍POSTECH提交给WMT 2019自动后期编辑（APE）共享任务的内容。在本文中，我们通过扩展Transformer提出了一种新的多源APE模型。我们研究的主要贡献在于，我们（1）重构编码器以生成翻译（ml）及其src上下文的联合表示，除了传统的src编码之外，以及2）建议两种类型的多源关注层进行计算编码器的两个输出之间的注意和解码器中的解码器状态。此外，我们通过应用各种教师强迫率来减轻暴露偏见，从而训练我们的模型。最后，我们在模型的各个变体之间采用了集成技术。在WMT19英德APE数据集上进行的实验表明，相对于基线而言，TER和BLEU得分均有所提高。与基准相比，我们的主要提交在TER上达到-0.73，在BLEU上达到+1.49，在所有提交的系统中排名第二。

著录项

来源
《Conference on machine translation;Annual meeting of the Association for Computational Linguistics》|2019年|112-117|共6页
会议地点
作者
WonKee Lee; Jaehun Shin; Jong-hyeok Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A hierarchical temporal attention-based LSTM encoder-decoder model for individual mobility prediction [J] . Li Fa, Gui Zhipeng, Zhang Zhaoyu, Neurocomputing . 2020,第Auga25期

机译：基于分层时间关注的单个移动预测的LSTM编码器 - 解码器模型
2. An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents [J] . Nam Tuan Ly, Cuong Tuan Nguyen, Masaki Nakagawa Pattern recognition letters . 2020,第Auga期

机译：基于注意力的行列编码器 - 解码器模型，用于日语历史文档中的文本识别
3. Attention-Based Personalized Encoder-Decoder Model for Local Citation Recommendation [J] . Libin Yang, Zeqing Zhang, Xiaoyan Cai, Computational intelligence and neuroscience . 2019,第4期

机译：基于关注的本地引文推荐的个性化个性化编码器 - 解码器模型
4. Transformer-based Automatic Post-Editing Model with Joint Encoder and Multi-source Attention of Decoder [C] . WonKee Lee, Jaehun Shin, Jong-hyeok Lee Conference on machine translation . 2019

机译：基于变压器的自动编辑模型，具有联合编码器和解码器的多源关注
5. Unsupervised Multivariate Time Series Anomaly Detection via Transformer-Based Models and Time Series Encoding [D] . Duan, Tinglin. 2021

机译：无监督的多变量时间序列异常检测通过基于变压器的模型和时间序列编码
6. Attention-Based Personalized Encoder-Decoder Model for Local Citation Recommendation [O] . Libin Yang, Zeqing Zhang, Xiaoyan Cai, 2019

机译：基于注意力的本地引用推荐个性化编解码器模型
7. Transformer-based Automatic Post-Editing Model with Joint Encoder and Multi-source Attention of Decoder [O] . WonKee Lee, Jaehun Shin, Jong-Hyeok Lee 2019

机译：基于变压器的自动编辑模型，具有联合编码器和解码器的多源关注

Transformer-based Automatic Post-Editing Model with Joint Encoder and Multi-source Attention of Decoder

摘要

著录项

相似文献

相关主题

期刊订阅