Semantically Guided Visual Question Answering

机译：语义引导的视觉问题解答

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a novel approach to enhance the challenging task of Visual Question Answering (VQA) by incorporating and enriching semantic knowledge in a VQA model. We first apply Multiple Instance Learning (MIL) to extract a richer visual representation addressing concepts beyond objects such as actions and colors. Motivated by the observation that semantically related answers often appear together in prediction, we further develop a new semantically-guided loss function for model learning which has the potential to drive weakly-scored but correct answers to the top while suppressing wrong answers. We show that these two ideas contribute to performance improvement in a complementary way. We demonstrate competitive results comparable to the state of the art on two VQA benchmark datasets.

机译：我们提出了一种新颖的方法，通过在VQA模型中纳入和丰富语义知识来增强具有挑战性的视觉问题解答（VQA）的任务。我们首先应用多实例学习（MIL）来提取更丰富的视觉表示，以解决诸如动作和颜色之类的对象之外的概念。由于观察到语义相关的答案经常在预测中同时出现，因此我们进一步开发了一种新的语义指导的损失函数用于模型学习，该函数可以将得分较低但正确的答案推到顶部，同时抑制错误的答案。我们显示这两个想法以互补的方式有助于提高性能。我们在两个VQA基准数据集上展示了与最新技术水平相当的竞争结果。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2018年|1852-1860|共9页
会议地点
作者
Handong Zhao; Quanfu Fan; Dan Gutfreund; Yun Fu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Feature extraction; Automobiles; Semantics; Predictive models; Computational modeling; Task analysis;

机译：可视化;特征提取;汽车;语义;预测模型;计算模型;任务分析;

相似文献

外文文献
中文文献
专利

1. R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering [J] . Pan Lu, Lei Ji, Wei Zhang, SIGKDD explorations . 2018,第Udisk期

机译：R-VQA：学习具有语义关注的视觉关系事实，用于视觉问题应答
2. Visual Question Answering via Combining Inferential Attention and Semantic Space Mapping [J] . Liu Yun, Zhang Xiaoming, Huang Feiran, Knowledge-Based Systems . 2020,第Nova5期

机译：通过相结合的推动和语义空间映射来应答视觉问题
3. SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions [J] . Sarrouti Mourad, Ouatik El Alaoui Said Artificial intelligence in medicine . 2020,第Jana期

机译：SemBioNLQA：一种语义生物医学问题解答系统，用于检索对自然语言问题的准确和理想答案
4. Semantically Guided Visual Question Answering [C] . Handong Zhao, Quanfu Fan, Dan Gutfreund, IEEE Conference on Applications of Computer Vision . 2018

机译：语义指导视觉问题应答
5. Long-answer question answering and rhetorical-semantic relations. [D] . Blair-Goldensohn, Sasha J. 2007

机译：长答案问题解答和修辞语义关系。
6. COVID-19 information retrieval with deep-learning based semantic search question answering and abstractive summarization [O] . Andre Esteva, Anuprit Kale, Romain Paulus, 2021

机译：Covid-19信息检索与深学习的语义搜索问题应答和抽象摘要
7. Knowledge and Cross-Pair Pattern Guided Semantic Matching for Question Answering [O] . Zihan Xu, Hai-Tao Zheng, Shaopeng Zhai, 2020

机译：知识和交叉对模式引导语义匹配问题应答

Semantically Guided Visual Question Answering

摘要

著录项

相似文献

相关主题

期刊订阅