首页> 外文会议>European conference on computer vision workshops >Knowing Where to Look? Analysis on Attention of Visual Question Answering System
【24h】

Knowing Where to Look? Analysis on Attention of Visual Question Answering System

机译:知道去哪里看?视觉问答系统的注意度分析

获取原文

摘要

Attention mechanisms have been widely used in Visual Question Answering (VQA) solutions due to their capacity to model deep cross-domain interactions. Analyzing attention maps offers us a perspective to find out limitations of current VQA systems and an opportunity to further improve them. In this paper, we select two state-of-the-art VQA approaches with attention mechanisms to study their robustness and disadvantages by visualizing and analyzing their estimated attention maps. We find that both methods are sensitive to features, and simultaneously, they perform badly for counting and multi-object related questions. We believe that the findings and analytical method will help researchers identify crucial challenges on the way to improve their own VQA systems.
机译:注意机制由于能够对深度跨域交互进行建模,因此已广泛用于视觉问题解答(VQA)解决方案中。分析注意力图为我们提供了一个发现当前VQA系统局限性的视角,并为进一步改善它们提供了机会。在本文中,我们选择两种具有关注机制的最新VQA方法,通过可视化和分析其估计的关注图来研究其健壮性和缺点。我们发现这两种方法都对特征敏感,同时,它们在计数和与多对象相关的问题上表现不佳。我们相信,这些发现和分析方法将有助于研究人员确定改进自身VQA系统的关键挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号