首页> 外文会议>International Conference on Pattern Recognition >P ≈ NP, at least in Visual Question Answering
【24h】

P ≈ NP, at least in Visual Question Answering

机译:P≈NP,至少在视觉问题的回答中

获取原文

摘要

In recent years, progress in the Visual Question Answering (VQA) field has largely been driven by public challenges and large datasets. One of the most widely-used of these is the VQA 2.0 dataset, consisting of polar (“yes/no”) and non-polar questions. Looking at the question distribution over all answers, we find that the answers “yes” and “no” account for 38% of the questions (19% per class), while the remaining 62% are spread over the remaining 3127 answers (0.02% per class). While several sources of biases have been investigated in the field, the effects of such an over-representation of polar questions remain unclear. In this paper, we measure the potential confounding factors when polar and non-polar samples are used jointly to train a baseline VQA classifier, and compare it to an upper bound where the over-representation of polar questions is excluded from the training. Further, we perform cross-over experiments to analyze how well the feature spaces of polar and non-polar samples align. Contrary to expectations, we find no evidence of counterproductive effects in the joint training of unbalanced classes. In fact, by exploring the intermediate feature space of visual-text embeddings, we find that the feature space of polar questions already encodes sufficient structure to answer many non-polar questions. Our results indicate that the polar (P) and the non-polar (NP) feature spaces are strongly aligned, hence the expression P ≈ NP.
机译:近年来,视觉问题应答(VQA)领域的进展基本上受到公共挑战和大型数据集的推动。其中最广泛使用的是VQA 2.0数据集,由极性(“是/否”)和非极性问题组成。看着所有答案的问题分布,我们发现答案“是”和“否”占问题的38%(每班19%),而剩余的62%会在剩余的3127答案中传播(0.02%)每班)。虽然已经在现场研究了几个偏见来源,但是这种极地问题的效果仍然不清楚。在本文中,我们测量极性和非极性样本与培训基线VQA分类器的使用,并将其与培训中的过度表示的上限进行比较。此外,我们执行交叉实验,分析极性和非极性样本对齐的特征空间。与期望相反,我们没有发现在不平衡课程的联合培训方面没有对适当影响的证据。实际上,通过探索视觉文本嵌入的中间特征空间,我们发现极地问题的特征空间已经编码了足够的结构来回答许多非极性问题。我们的结果表明,极性(P)和非极性(NP)特征空间强烈对齐,因此表达式P≈NP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号