Semantically Guided Visual Question Answering

机译：语义指导视觉问题应答

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a novel approach to enhance the challenging task of Visual Question Answering (VQA) by incorporating and enriching semantic knowledge in a VQA model. We first apply Multiple Instance Learning (MIL) to extract a richer visual representation addressing concepts beyond objects such as actions and colors. Motivated by the observation that semantically related answers often appear together in prediction, we further develop a new semantically-guided loss function for model learning which has the potential to drive weakly-scored but correct answers to the top while suppressing wrong answers. We show that these two ideas contribute to performance improvement in a complementary way. We demonstrate competitive results comparable to the state of the art on two VQA benchmark datasets.

机译：我们提出了一种新的方法，通过在VQA模型中纳入和丰富语义知识来增强视觉问题的挑战性任务（VQA）。我们首先应用多个实例学习（MIL）来提取更丰富的视觉表示，寻址超出诸如操作和颜色等对象的概念。通过观察到，语义相关的答案通常在预测中一起出现在一起，我们进一步开发了一种新的语义导向损失功能，用于模型学习，这有可能推动弱得分但抑制错误答案的答案。我们表明这两种想法有助于以互补的方式改善。我们展示了与两个VQA基准数据集上的最先进状态相当的竞争结果。

著录项

来源
《IEEE Conference on Applications of Computer Vision》|2018年|1377-2066p|共9页
会议地点
作者
Handong Zhao; Quanfu Fan; Dan Gutfreund; Yun Fu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering [J] . Pan Lu, Lei Ji, Wei Zhang, SIGKDD explorations . 2018,第Udisk期

机译：R-VQA：学习具有语义关注的视觉关系事实，用于视觉问题应答
2. Visual Question Answering via Combining Inferential Attention and Semantic Space Mapping [J] . Liu Yun, Zhang Xiaoming, Huang Feiran, Knowledge-Based Systems . 2020,第Nova5期

机译：通过相结合的推动和语义空间映射来应答视觉问题
3. SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions [J] . Sarrouti Mourad, Ouatik El Alaoui Said Artificial intelligence in medicine . 2020,第Jana期

机译：SemBioNLQA：一种语义生物医学问题解答系统，用于检索对自然语言问题的准确和理想答案
4. Semantically Guided Visual Question Answering [C] . Handong Zhao, Quanfu Fan, Dan Gutfreund, IEEE Winter Conference on Applications of Computer Vision . 2018

机译：语义引导的视觉问题解答
5. Long-answer question answering and rhetorical-semantic relations. [D] . Blair-Goldensohn, Sasha J. 2007

机译：长答案问题解答和修辞语义关系。
6. COVID-19 information retrieval with deep-learning based semantic search question answering and abstractive summarization [O] . Andre Esteva, Anuprit Kale, Romain Paulus, 2021

机译：Covid-19信息检索与深学习的语义搜索问题应答和抽象摘要
7. Knowledge and Cross-Pair Pattern Guided Semantic Matching for Question Answering [O] . Zihan Xu, Hai-Tao Zheng, Shaopeng Zhai, 2020

机译：知识和交叉对模式引导语义匹配问题应答

Semantically Guided Visual Question Answering

摘要

著录项

相似文献

相关主题

期刊订阅