【24h】

Sequence Generation with Target Attention

机译:序列生成具有目标关注

获取原文

摘要

Source-target attention mechanism (briefly, source attention) has become one of the key components in a wide range of sequence generation tasks, such as neural machine translation, image caption, and open-domain dialogue generation. In these tasks, the attention mechanism, typically in control of information flow from the encoder to the decoder, enables to generate every component in the target sequence relying on different source components. While source attention mechanism has attracted many research interests, few of them turn eyes to if the generation of target sequence can additionally benefit from attending back to itself, which however is intuitively motivated by the nature of attention. To investigate the question, in this paper, we propose a new target-target attention mechanism (briefly, target attention). Along the progress of generating target sequence, target attention mechanism takes into account the relationship between the component to generate and its preceding context within the target sequence, such that it can better keep the coherent consistency and improve the readability of the generated sequence. Furthermore, it complements the information from source attention so as to further enhance semantic adequacy. After designing an effective approach to incorporate target attention in encoder-decoder framework, we conduct extensive experiments on both neural machine translation and image caption. Experimental results clearly demonstrate the effectiveness of our design of integrating both source and target attention for sequence generation tasks.
机译:源 - 目标注意机制(简要介绍,源重点)已成为各种序列生成任务中的关键组件之一,例如神经机器翻译,图像标题和开放式对话生成。在这些任务中,通常以控制来自编码器到解码器的信息流控制的注意机制使得能够在依赖于不同的源分量的目标序列中生成每个组件。虽然来源注意机制吸引了许多研究兴趣,但如果目标序列的产生另外,其中很少有人转向自身,这一点是受关注的本质直观的激励。为了调查这个问题,在本文中,我们提出了一个新的目标目标注意机制(简要介绍,目标注意)。沿着生成目标序列的进展,目标注意机制考虑了组件之间的关系,并且在目标序列内生成的关系及其前面的上下文,使得它可以更好地保持相干一致性并提高所生成的序列的可读性。此外,它补充了来自源给予注意力的信息,以进一步增强语义充足。在设计有效的方法来在编码器 - 解码器框架中纳入目标注意之后,我们对神经机翻译和图像标题进行广泛的实验。实验结果清楚地展示了我们设计整合源和目标注意的设计的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号