Overcoming language priors in VQA via adding visual module

Zhao Jia; Zhang Xuesong; Wang XuefengYang YingSun Gang

首页> 外文期刊>Neural computing & applications >Overcoming language priors in VQA via adding visual module

【24h】

Overcoming language priors in VQA via adding visual module

机译：Overcoming language priors in VQA via adding visual module

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Visual Question Answering (VQA) is a new and popular research direction. Dealing with language prior problems has become a hot topic in VQA in the past two years. With the development of technologies relating to VQA, related scholars have realized that in VQA tasks, the generation of answers relies too much on language priors and considers less with visual content. Some of the previous methods to alleviate language priors only focus on processing the question, while the methods to increase visual acuity only concentrate on finding the correct region. To better overcome the language prior problem of VQA, we propose a method that will improve visual content further to enhance the impact of visual content on answers. Our method consists of three parts: the base model branch, the question-only model branch, and the visual model branch. Many experiments have been carried out on the three datasets VQA-CP v1, VQA-CP v2, and VQA v2, which proves the effectiveness of our method and further improves the accuracy of the different models. Our code is available in GitHub (https://github.com/shonnon-zxs/AddingVisualModule).

著录项

来源
《Neural computing & applications》 |2022年第11期|9015-9023|共9页
作者
Zhao Jia; Zhang Xuesong; Wang XuefengYang YingSun Gang;
展开▼
作者单位

Fuyang Normal Univ;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类人工神经网络计算机;人工智能理论;
关键词
Visual Question Answering; Language prior; Visual branches; Visual importance;

Overcoming language priors in VQA via adding visual module

摘要

著录项

相关主题

期刊订阅