Twice Opportunity Knocks Syntactic Ambiguity: A Visual Question Answering Model with yes/no Feedback

机译：两次机会敲句法模糊：一个视觉问题的回答模型，是/否反馈

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual Question Answering (VQA) is a joint task that aims to answer questions based on given images. During dialogs between humans, syntactic ambiguity is a common phenomenon and it also could be found in the questions of VQA systems. Generally, the existing methods for VQA utilize one-shot answering frameworks, which will face a great difficulty if syntactic ambiguity occurs in questions. In human dialogs, people often conquer the problem by feeding back questions for confirmation. Inspired by this observation, we propose a novel method to eliminate the syntactic ambiguity in VQA via the user's feedback. We compared our method with the existing methods on two benchmark datasets, CLEVR and CLEVR-CoGenT. We found that the accuracy of our method is close to 100% on the CLEVR dataset. On the CLEVR-CoGenT dataset, our method is also 21% higher than the state-of-the-art method.

机译：视觉问题应答（VQA）是一个联合任务，旨在根据给定的图像回答问题。在人类之间的对话期间，句法歧义是一种常见的现象，它也可以在VQA系统的问题中找到。通常，VQA的现有方法利用单次应答框架，如果在问题中发生句法模糊，则会面临很大困难。在人类对话中，人们常常通过喂回问题来征服问题进行确认。灵感来自这种观察，我们提出了一种新的方法，通过用户的反馈消除VQA中的句法模糊。我们将我们的方法与两个基准数据集，CLEVR和CLEVR-Cogent的现有方法进行了比较。我们发现，我们的方法的准确性在CLEVR数据集中接近100％。在Clevr-Cogent DataSet上，我们的方法也比最先进的方法高出21％。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2019年|621p|共6页
会议地点
作者
Jianming Wang; Wei Deng; Yukuan Sun; Yuanyuan Li; Kai Wang; Guanghao Jin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类多媒体技术与多媒体计算机;
关键词
Syntactics; Visualization; Feature extraction; Task analysis; Linguistics; Knowledge discovery; Layout;

机译：句法;可视化;特征提取;任务分析;语言学;知识发现;布局;

相似文献

外文文献
中文文献
专利

1. Visual question answering via Attention-based syntactic structure tree-LSTM [J] . Liu Yun, Zhang Xiaoming, Huang Feiran, Applied Soft Computing . 2019,第期

机译：通过基于关注的句法结构树-LSTM的视觉问题回答
2. Differences in Reaction to Immediate Feedback and Opportunity to Revise Answers for Multiple-Choice and Open-Ended Questions [J] . Attali Yigal, Laitusis Cara, Stone Elizabeth Educational and Psychological Measurement . 2016,第5期

机译：对即时反馈的反应差异以及对多项选择题和开放式问题的答案进行修改的机会
3. Immediate Feedback and Opportunity to Revise Answers to Open-Ended Questions [J] . Attali Y, Powers D Educational and Psychological Measurement . 2010,第1期

机译：立即反馈和修改开放式问题答案的机会
4. Twice Opportunity Knocks Syntactic Ambiguity: A Visual Question Answering Model with yeso Feedback [C] . Jianming Wang, Wei Deng, Yukuan Sun, IEEE International Conference on Multimedia and Expo . 2019

机译：两次机会敲响句法歧义：是/否反馈的视觉问题回答模型
5. An Analysis of Bottom-Up Attention Models and Multimodal Representation Learning for Visual Question Answering [D] . Narayanan, Venkatraman . 2019

机译：视觉问题应答的自下而上关注模型和多式联表学习分析
6. Differences in Reaction to Immediate Feedback and Opportunity to Revise Answers for Multiple-Choice and Open-Ended Questions [O] . Yigal Attali, Cara Laitusis, Elizabeth Stone 2016

机译：对即时反馈的反应差异以及对多项选择题和开放式问题的答案进行修改的机会
7. Differences in Reaction to Immediate Feedback and Opportunity to Revise Answers for Multiple-Choice and Open-Ended Questions [O] . Yigal Attali, Cara Laitusis, Elizabeth Stone 2016

机译：对即时反馈的反应和修改多项选择和开放式问题的答案的机会的差异

Twice Opportunity Knocks Syntactic Ambiguity: A Visual Question Answering Model with yes/no Feedback

摘要

著录项

相似文献

相关主题

期刊订阅