Visual Turing test for computer vision systems

Geman Donald; Geman Stuart; Hallonquist NeilYounes Laurent

首页> 外文期刊>Proceedings of the National Academy of Sciences of the United States of America >Visual Turing test for computer vision systems

【24h】

Visual Turing test for computer vision systems

机译：Visual Turing test for computer vision systems

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Today, computer vision systems are tested by their accuracy in detecting and localizing instances of objects. As an alternative, and motivated by the ability of humans to provide far richer descriptions and even tell a story about an image, we construct a "visual Turing test": an operator-assisted device that produces a stochastic sequence of binary questions from a given test image. The query engine proposes a question; the operator either provides the correct answer or rejects the question as ambiguous; the engine proposes the next question ("just-in-time truthing"). The test is then administered to the computer-vision system, one question at a time. After the system's answer is recorded, the system is provided the correct answer and the next question. Parsing is trivial and deterministic; the system being tested requires no natural language processing. The query engine employs statistical constraints, learned from a training set, to produce questions with essentially unpredictable answers-the answer to a question, given the history of questions and their correct answers, is nearly equally likely to be positive or negative. In this sense, the test is only about vision. The system is designed to produce streams of questions that follow natural story lines, from the instantiation of a unique object, through an exploration of its properties, and on to its relationships with other uniquely instantiated objects.

著录项

来源
《Proceedings of the National Academy of Sciences of the United States of America》 |2015年第12期|3618-3623|共6页
作者
Geman Donald; Geman Stuart; Hallonquist NeilYounes Laurent;
展开▼
作者单位

Johns Hopkins Univ, Dept Appl Math & Stat, Baltimore, MD 21287 USA;

Brown Univ, Div Appl Math, Providence, RI 02912 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《生物学医学文摘》(MEDLINE);美国《化学文摘》(CA);
原文格式 PDF
正文语种英语
中图分类自然科学总论;
关键词
scene interpretation; computer vision; Turing test; binary questions; unpredictable answers;
入库时间 2024-01-29 17:15:53

Visual Turing test for computer vision systems

摘要

著录项

相关主题

期刊订阅