首页> 外文会议>European conference on computer vision >Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
【24h】

Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering

机译:询问,出席和回答:探索视觉引导的问答式空间注意

获取原文

摘要

We address the problem of Visual Question Answering (VQA), which requires joint image and language understanding to answer a question about a given photograph. Recent approaches have applied deep image captioning methods based on convolutional-recurrent networks to this problem, but have failed to model spatial inference. To remedy this, we propose a model we call the Spatial Memory Network and apply it to the VQA task. Memory networks are recurrent neural networks with an explicit attention mechanism that selects certain parts of the information stored in memory. Our Spatial Memory Network stores neuron activations from different spatial regions of the image in its memory, and uses attention to choose regions relevant for computing the answer. We propose a novel question-guided spatial attention architecture that looks for regions relevant to either individual words or the entire question, repeating the process over multiple recurrent steps, or "hops". To better understand the inference process learned by the network, we design synthetic questions that specifically require spatial inference and visualize the network's attention. We evaluate our model on two available visual question answering datasets and obtain improved results.
机译:我们解决了视觉问题解答(VQA)的问题,该问题要求对图像和语言有共同的理解才能回答有关给定照片的问题。最近的方法已将基于卷积递归网络的深度图像字幕方法应用于此问题,但未能对空间推断进行建模。为了解决这个问题,我们提出了一个称为空间内存网络的模型,并将其应用于VQA任务。记忆网络是循环神经网络,具有显式的注意力机制,可以选择存储在存储器中的信息的某些部分。我们的空间记忆网络将来自图像不同空间区域的神经元激活存储在其记忆中,并注意选择与计算答案相关的区域。我们提出了一种新颖的以问题为导向的空间注意架构,该架构寻找与单个单词或整个问题相关的区域,并在多个重复步骤或“跳跃”中重复该过程。为了更好地理解网络学习到的推理过程,我们设计了综合性问题,这些问题特别需要空间推理并可视化网络的注意力。我们在两个可用的视觉问题回答数据集上评估了我们的模型,并获得了改进的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号