VizWiz Grand Challenge: Answering Visual Questions from Blind People

机译：VizWiz大挑战：回答盲人的视觉问题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The study of algorithms to automatically answer visual questions currently is motivated by visual question answering (VQA) datasets constructed in artificial VQA settings. We propose VizWiz, the first goal-oriented VQA dataset arising from a natural VQA setting. VizWiz consists of over 31,000 visual questions originating from blind people who each took a picture using a mobile phone and recorded a spoken question about it, together with 10 crowdsourced answers per visual question. VizWiz differs from the many existing VQA datasets because (1) images are captured by blind photographers and so are often poor quality, (2) questions are spoken and so are more conversational, and (3) often visual questions cannot be answered. Evaluation of modern algorithms for answering visual questions and deciding if a visual question is answerable reveals that VizWiz is a challenging dataset. We introduce this dataset to encourage a larger community to develop more generalized algorithms that can assist blind people.

机译：当前，在人工VQA设置中构建的视觉问题解答（VQA）数据集推动了对自动回答视觉问题的算法的研究。我们提出VizWiz，这是自然VQA设置产生的第一个面向目标的VQA数据集。 VizWiz包含来自盲人的31,000多个视觉问题，每个盲人都使用手机拍照并记录了口头问题，每个视觉问题有10个众包回答。 VizWiz与许多现有的VQA数据集有所不同，因为（1）图像是由盲人摄影师捕获的，因此通常质量较差；（2）说出了问题，因此交谈性更高；（3）经常无法回答视觉问题。对用于回答视觉问题并确定视觉问题是否可回答的现代算法的评估表明，VizWiz是一个具有挑战性的数据集。我们引入此数据集以鼓励更大的社区开发可以帮助盲人的更通用的算法。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|3608-3617|共10页
会议地点 Salt Lake City(US)
作者
Danna Gurari; Qing Li; Abigale J. Stangl; Anhong Guo; Chi Lin; Kristen Grauman; Jiebo Luo; Jeffrey P. Bigham;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Blindness; Prediction algorithms; Lighting; Computer vision; Mobile handsets; Shape;

机译：可视化；失明;预测算法；灯光;计算机视觉;手机；形状;

相似文献

外文文献
中文文献
专利

1. Visual question answering: Datasets, algorithms, and future challenges [J] . Kushal Kafle, Christopher Kanan Computer vision and image understanding . 2017,第octa期

机译：视觉问题解答：数据集，算法和未来挑战
2. Multiple answers to a question: a new approach for visual question answering [J] . Hosseinabad Sayedshayan Hashemi, Safayani Mehran, Mirzaei Abdolreza The Visual Computer . 2021,第1期

机译：问题的多个答案：一种新的视觉问题接听方法
3. Question-aware prediction with candidate answer recommendation for visual question answering [J] . B. Kim, J. Kim Electronics Letters . 2017,第18期

机译：带有候选答案推荐的问题感知预测，用于视觉问答
4. VizWiz Grand Challenge: Answering Visual Questions from Blind People [C] . Danna Gurari, Qing Li, Abigale J. Stangl, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：Vizwiz Grand Challenge：回答盲人的视觉问题
5. Social Microvolunteering: Quick, Free Answers to Visual Questions from Blind People [D] . Brady, Erin 2015

机译：社会微志愿者：盲人视觉问题的快速，免费答案
6. Answers to the Health Questions People Ask in Libraries [O] . Shari Clifton 2009

机译：人们在图书馆提出的健康问题的答案
7. Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge [O] . Damien Teney, Peter Anderson, Xiaodong He, 2018

机译：视觉问题的提示和技巧回答：2017年挑战的学习

VizWiz Grand Challenge: Answering Visual Questions from Blind People

摘要

著录项

相似文献

相关主题

期刊订阅