Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation

机译：学习何时集中注意力或转移注意力：神经机器翻译的自适应注意力温度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of the Neural Machine Translation (NMT) models are based on the sequence-to-sequence (Seq2Seq) model with an encoder-decoder framework equipped with the attention mechanism. However, the conventional attention mechanism treats the decoding at each time step equally with the same matrix, which is problematic since the softness of the attention for different types of words (e.g. content words and function words) should differ. Therefore, we propose a new model with a mechanism called Self-Adaptive Control of Temperature (SACT) to control the softness of attention by means of an attention temperature. Experimental results on the Chinese-English translation and English-Vietnamese translation demonstrate that our model outperforms the baseline models, and the analysis and the case study show that our model can attend to the most relevant elements in the source-side contexts and generate the translation of high quality.

机译：大多数神经机器翻译（NMT）模型都是基于序列到序列（Seq2Seq）模型的，该模型具有配备了注意机制的编码器-解码器框架。但是，传统的注意机制用相同的矩阵在每个时间步上均等地对待解码，这是有问题的，因为对于不同类型的词（例如，内容词和功能词）的注意的软度应该不同。因此，我们提出了一种新的模型，该模型具有一种称为温度的自适应控制（SACT）的机制，可以通过注意力温度来控制注意力的柔和度。汉英翻译和英越南翻译的实验结果表明，我们的模型优于基准模型，而分析和案例研究表明，我们的模型可以处理源语言环境中最相关的元素并生成翻译高品质。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|2985-2990|共6页
会议地点
作者
Junyang Lin; Xu Sun; Xuancheng Ren; Muyu Li; Qi Su;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Neural Machine Translation with Deep Attention [J] . IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第1期

机译：高度重视的神经机器翻译
2. Neural Machine Translation With GRU-Gated Attention Model [J] . Zhang Biao, Xiong Deyi, Xie Jun, Neural Networks and Learning Systems, IEEE Transactions on . 2020,第11期

机译：Gru门控注意模型的神经机翻译
3. A Systematic Study of Inner-Attention-Based Sentence Representations in Multilingual Neural Machine Translation [J] . Raúl Vázquez, Alessandro Raganato, Mathias Creutz, Computational linguistics . 2020,第2期

机译：多语种神经机翻译中基于内部关注的句子表示的系统研究
4. Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation [C] . Junyang Lin, Xu Sun, Xuancheng Ren, Conference on empirical methods in natural language processing . 2018

机译：学习何时专注或转移注意力：神经电机翻译的自适应注意温度
5. Translational research on sustained attention and attentional control in rats, healthy humans and patients with Schizophrenia. [D] . Demeter, Elise Marie. 2011

机译：在大鼠，健康人和精神分裂症患者中持续注意和注意控制的转化研究。
6. Play attention and learning: How do play and timing shape the development of attention and influence classroom learning? [O] . James H Hedges, Karen E Adolph, Dima Amso, -1

机译：玩耍注意力和学习：玩耍和时间安排如何影响注意力的发展并影响课堂学习？
7. Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation [O] . Junyang Lin, Xu Sun, Xuancheng Ren, 2018

机译：学习何时专注或转移注意力：神经电机翻译的自适应注意温度
8. Hierarchical Neural Network (HNN) for Closed Loop Decision Making: Designing the Architecture of a Hierarchical Neural Network to Model Attention, Learning and Goal Oriented Behavior. [R] . Guez, A. 1990

机译：用于闭环决策的分层神经网络（HNN）：设计层次神经网络的体系结构以模拟注意，学习和目标导向行为。

Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅