An Analysis of Encoder Representations in Transformer-Based Machine Translation

机译：基于变压器的机器翻译中的编码器表示分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The attention mechanism is a successful technique in modern NLP, especially in tasks like machine translation. The recently proposed network architecture of the Transformer is based entirely on attention mechanisms and achieves new state of the art results in neural machine translation, outperforming other sequence-to-sequence models. However, so far not much is known about the internal properties of the model and the representations it learns to achieve that performance. To study this question, we investigate the information that is learned by the attention mechanism in Transformer models with different translation quality. We assess the representations of the encoder by extracting dependency relations based on self-attention weights, we perform four probing tasks to study the amount of syntactic and semantic captured information and we also test attention in a transfer learning scenario. Our analysis sheds light on the relative strengths and weaknesses of the various encoder representations. We observe that specific attention heads mark syntactic dependency relations and we can also confirm that lower layers tend to learn more about syntax while higher layers tend to encode more semantics.

机译：注意机制是现代NLP中的成功技术，特别是在机器翻译中的任务中。最近提出的变压器网络架构完全基于注意机制，实现了神经电机翻译的新状态，优于其他序列到序列模型。然而，到目前为止，关于模型的内部属性以及它学会实现这种性能的陈述的知识并不多。要研究这个问题，我们调查了具有不同翻译质量的变压器模型中的注意机制学到的信息。我们通过基于自我注意重量提取依赖关系来评估编码器的表示，我们执行四个探测任务以研究句法和语义捕获信息的数量，并在转移学习场景中进行关注。我们的分析揭示了各种编码器表示的相对优势和弱点。我们观察到，具体的注意力头标记句法依赖关系，我们还可以确认较低层倾向于了解有关语法的更多信息，而较高的层倾向于编码更多语义。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|xviii 386 p.|共11页
会议地点
作者
Alessandro Raganato; Jorg Tiedemann;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-20 23:27:03

相似文献

外文文献
中文文献
专利

1. Improving Transformer-Based Neural Machine Translation with Prior Alignments [J] . Thien Nguyen, Lam Nguyen, Phuoc Tran, Complexity . 2021,第a期

机译：用现有对齐改善基于变压器的神经电机翻译
2. Enhancing Transformer-based language models with commonsense representations for knowledge-driven machine comprehension [J] . Li Ronghan, Jiang Zejun, Wang Lifang, Knowledge-Based Systems . 2021,第MAYa23期

机译：通过用于知识驱动的机器理解的致致通知表示，增强基于变压器的语言模型
3. Role of Rostral Fastigial Neurons in Encoding a Body-Centered Representation of Translation in Three Dimensions [J] . Martin Christophe Z., Brooks Jessica X., Green Andrea M. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2018,第14期

机译：讲轴装短心神经元在三维翻译中编码的角色的作用
4. An Analysis of Encoder Representations in Transformer-Based Machine Translation [C] . Alessandro Raganato, Jorg Tiedemann 1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018 . 2018

机译：基于变压器的机器翻译中的编码器表示分析
5. Latent Semantic Analysis, Corpus stylistics and Machine Learning Stylometry for Translational and Authorial Style Analysis: The Case of Denys Johnson-Davies' Translations into English. [D] . Al Batineh, Mohammed. 2015

机译：潜在语义分析，语料库样式学和机器学习样式法，用于翻译和作者风格分析：以Denys Johnson-Davies的英语翻译为例。
6. Ontologies Knowledge Representation and Machine Learning for Translational Research: Recent Contributions [O] . Peter N. Robinson, Melissa A. Haendel 2020

机译：翻译研究的本体知识表示和机器学习：最近的贡献
7. Debugging Translations of Transformer-based Neural Machine Translation Systems [O] . Matīss Rikters, Mārcis Pinnis 2018

机译：基于变压器的神经机翻译系统调试翻译

An Analysis of Encoder Representations in Transformer-Based Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅