Answer-Type Prediction for Visual Question Answering

机译：视觉问题回答的答案类型预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, algorithms for object recognition and related tasks have become sufficiently proficient that new vision tasks can now be pursued. In this paper, we build a system capable of answering open-ended text-based questions about images, which is known as Visual Question Answering (VQA). Our approach's key insight is that we can predict the form of the answer from the question. We formulate our solution in a Bayesian framework. When our approach is combined with a discriminative model, the combined model achieves state-of-the-art results on four benchmark datasets for open-ended VQA: DAQUAR, COCO-QA, The VQA Dataset, and Visual7W.

机译：近来，用于对象识别和相关任务的算法已经变得足够熟练，以使得现在可以追求新的视觉任务。在本文中，我们构建了一个能够回答有关图像的开放式基于文本的问题的系统，称为视觉问题解答（VQA）。我们的方法的主要见解是，我们可以根据问题预测答案的形式。我们在贝叶斯框架中制定解决方案。当我们的方法与判别模型结合使用时，结合的模型可以在开放式VQA的四个基准数据集上获得最新的结果：DAQUAR，COCO-QA，VQA数据集和Visual7W。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2016年|4976-4984|共9页
会议地点
作者
Kushal Kafle; Christopher Kanan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image color analysis; Visualization; Bayes methods; Knowledge discovery; Neural networks; Encoding; Semantics;

机译：图像色彩分析可视化贝叶斯方法知识发现神经网络编码语义;

相似文献

外文文献
中文文献
专利

1. Question-aware prediction with candidate answer recommendation for visual question answering [J] . B. Kim, J. Kim Electronics Letters . 2017,第18期

机译：带有候选答案推荐的问题感知预测，用于视觉问答
2. Visual potential expert prediction in question and answering communities [J] . Xiong Xiaoxiao, Fu Min, Zhu Min, Journal of Visual Languages & Computing . 2018,第OCTa期

机译：问答社区中潜在的视觉专家预测
3. Multiple answers to a question: a new approach for visual question answering [J] . Hosseinabad Sayedshayan Hashemi, Safayani Mehran, Mirzaei Abdolreza The Visual Computer . 2021,第1期

机译：问题的多个答案：一种新的视觉问题接听方法
4. Answer-Type Prediction for Visual Question Answering [C] . Kushal Kafle, Christopher Kanan IEEE Conference on Computer Vision and Pattern Recognition . 2016

机译：答案类型预测视觉问题应答
5. Attention Correction Mechanisms in Visual Contexts in Visual Question Answering [D] . Sharan, Komal 2018

机译：视觉问答中视觉上下文中的注意力纠正机制
6. Question Popularity Analysis and Prediction in Community Question Answering Services [O] . Ting Liu, Wei-Nan Zhang, Liujuan Cao, -1

机译：社区问答服务中的问题流行度分析与预测
7. Answer-Type Prediction for Visual Question Answering [O] . Kushal Kafle, Christopher Kanan 2016

机译：视觉问答的答案类型预测
8. Questions and Answers on Quality, the ISO 9000 Standard Series, Quality SystemRegistration, and Related Issues. More Questions and Answers on the ISO 9000 Standard Series and Related Issues [R] . Breitenberg, M. 1993

机译：有关质量的问题和解答，IsO 9000标准系列，质量体系注册和相关问题。有关IsO 9000标准系列及相关问题的更多问题和解答

Answer-Type Prediction for Visual Question Answering

摘要

著录项

相似文献

相关主题

期刊订阅