首页> 外文OA文献 >Robustness Analysis of Visual Question Answering Models by Basic Questions
【2h】

Robustness Analysis of Visual Question Answering Models by Basic Questions

机译:基本问题对视觉问答模型的鲁棒性分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Visual Question Answering (VQA) models should have both high robustness and accuracy. Unfortunately, most of the current VQA research only focuses on accuracy because there is a lack of proper methods to measure the robustness of VQA models. There are two main modules in our algorithm. Given a natural language question about an image, the first module takes the question as input and then outputs the ranked basic questions, with similarity scores, of the main given question. The second module takes the main question, image and these basic questions as input and then outputs the text-based answer of the main question about the given image. We claim that a robust VQA model is one, whose performance is not changed much when related basic questions as also made available to it as input. We formulate the basic questions generation problem as a LASSO optimization, and also propose a large scale Basic Question Dataset (BQD) and Rscore (novel robustness measure), for analyzing the robustness of VQA models. We hope our BQD will be used as a benchmark for to evaluate the robustness of VQA models, so as to help the community build more robust and accurate VQA models.
机译:视觉问答(VQA)模型应具有很高的鲁棒性和准确性。不幸的是,由于缺乏适当的方法来测量VQA模型的健壮性,当前的大多数VQA研究仅关注准确性。我们的算法有两个主要模块。给定有关图像的自然语言问题,第一个模块将问题作为输入,然后输出主要给定问题的具有相似性得分的排名基本问题。第二个模块将主要问题,图像和这些基本问题作为输入,然后输出有关给定图像的主要问题的基于文本的答案。我们声称健壮的VQA模型就是一个模型,当相关的基本问题也可以作为输入使用时,其性能不会发生太大变化。我们将基本问题生成问题表述为LASSO优化,并提出了大规模的基本问题数据集(BQD)和Rscore(新颖鲁棒性度量),用于分析VQA模型的鲁棒性。我们希望我们的BQD将用作评估VQA模型稳健性的基准,以帮助社区构建更健壮和准确的VQA模型。

著录项

  • 作者

    Huang Jia-Hong;

  • 作者单位
  • 年度 2017
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号