【24h】

Sequence Generation with Target Attention

机译:具有目标注意力的序列生成

获取原文

摘要

Source-target attention mechanism (briefly, source attention) has become one of the key components in a wide range of sequence generation tasks, such as neural machine translation, image caption, and open-domain dialogue generation. In these tasks, the attention mechanism, typically in control of information flow from the encoder to the decoder, enables to generate every component in the target sequence relying on different source components. While source attention mechanism has attracted many research interests, few of them turn eyes to if the generation of target sequence can additionally benefit from attending back to itself, which however is intuitively motivated by the nature of attention. To investigate the question, in this paper, we propose a new target-target attention mechanism (briefly, target attention). Along the progress of generating target sequence, target attention mechanism takes into account the relationship between the component to generate and its preceding context within the target sequence, such that it can better keep the coherent consistency and improve the readability of the generated sequence. Furthermore, it complements the information from source attention so as to further enhance semantic adequacy. After designing an effective approach to incorporate target attention in encoder-decoder framework, we conduct extensive experiments on both neural machine translation and image caption. Experimental results clearly demonstrate the effectiveness of our design of integrating both source and target attention for sequence generation tasks.
机译:源目标关注机制(简称源关注)已成为一系列序列生成任务(如神经机器翻译,图像标题和开放域对话生成)中的关键组件之一。在这些任务中,关注机制(通常用于控制从编码器到解码器的信息流)能够根据不同的源组件生成目标序列中的每个组件。尽管源代码关注机制吸引了许多研究兴趣,但很少有人关注目标序列的生成是否可以额外受益于自身参与,然而,这是关注本质的直观驱动。为了研究这个问题,在本文中,我们提出了一种新的目标-目标注意机制(简称目标注意)。随着目标序列生成的进展,目标注意力机制考虑了要生成的组件与其在目标序列中先前上下文之间的关系,从而可以更好地保持连贯的一致性并提高生成序列的可读性。此外,它补充了来自源关注点的信息,从而进一步增强了语义上的适当性。在设计了一种将目标注意力纳入编码器-解码器框架的有效方法之后,我们对神经机器翻译和图像字幕进行了广泛的实验。实验结果清楚地证明了我们在序列生成任务中整合源和目标注意力的设计的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号