首页> 外文会议>German Conference on Pattern Recognition >DynGraph: Visual Question Answering via Dynamic Scene Graphs
【24h】

DynGraph: Visual Question Answering via Dynamic Scene Graphs

机译:Dyngraph:通过动态场景图答应答的视觉问题

获取原文

摘要

Due to the rise of deep learning, reasoning across various domains, such as vision, language, robotics, and control, has seen major progress in recent years. A popular benchmark for evaluating models for visual reasoning is Visual Question Answering (VQA), which aims at answering questions about a given input image by joining the two modalities: (1) the text representing the question, as well as, (2) the visual information extracted from the input image. In this work, we propose a structured approach for VQA that is based on dynamic graphs learned automatically from the input. Unlike the common approach for VQA that relies on an attention mechanism applied on a cell-structured global embedding of the image, our model leverages the rich structure in the image depicted in the object instances and their interaction. In our model, nodes in the graph correspond to object instances present in the image while the edges represent relations among them. Our model automatically constructs the scene graph and attends to the relations among the nodes to answer the given question. Hence, our model can be trained end-to-end and it does not require additional training labels in the form of predefined graphs or relations. We demonstrate the effectiveness of our approach on the challenging open-ended Visual Genome benchmark for VQA.
机译:由于深入学习的兴起,近年来,各个领域的推理,如愿景,语言,机器人和控制,近年来都有重大进展。一种流行的基准,用于评估视觉推理模型是视觉问题的应答(VQA),其目的在于通过加入两个模态来应答关于给定输入图像的问题:(1)代表问题的文本,以及(2)从输入图像中提取的可视信息。在这项工作中,我们提出了一种基于从输入自动学习的动态图形的VQA的结构化方法。与VQA的共同方法不同,依赖于应用于图像的细胞结构全局嵌入的注意机制,我们的模型利用了对象实例中描绘的图像中的丰富结构及其交互。在我们的模型中,图中的节点对应于图像中存在的对象实例,而边缘代表它们之间的关系。我们的模型自动构建场景图,并参加节点之间的关系以回答给定的问题。因此,我们的模型可以训练结束于结束,它不需要以预定义的图形或关系的形式额外的训练标签。我们展示了我们对VQA挑战开放式视觉基因组基准的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号