ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models

机译：ObjectNet：一个大规模的偏置控制数据集，用于推动对象识别模型的限制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We collect a large real-world test set, ObjectNet, for object recognition with controls where object backgrounds, rotations, and imaging viewpoints are random. Most scientific experiments have controls, confounds which are removed from the data, to ensure that subjects cannot perform a task by exploiting trivial correlations in the data. Historically, large machine learning and computer vision datasets have lacked such controls. This has resulted in models that must be fine-tuned for new datasets and perform better on datasets than in real-world applications. When tested on ObjectNet, object detectors show a 40-45% drop in performance, with respect to their performance on other benchmarks, due to the controls for biases. Controls make ObjectNet robust to fine-tuning showing only small performance increases. We develop a highly automated platform that enables gathering datasets with controls by crowdsourcing image capturing and annotation. ObjectNet is the same size as the ImageNet test set (50,000 images), and by design does not come paired with a training set in order to encourage generalization. The dataset is both easier than ImageNet - objects are largely centered and unoccluded - and harder, due to the controls. Although we focus on object recognition here, data with controls can be gathered at scale using automated tools throughout machine learning to generate datasets that exercise models in new ways thus providing valuable feedback to researchers. This work opens up new avenues for research in generalizable, robust, and more human-like computer vision and in creating datasets where results are predictive of real-world performance.

机译：我们收集一个大型真实测试集，ObjectNet，用于对象识别，控制对象背景，旋转和成像视点是随机的。大多数科学实验都有控制，从数据中删除的混淆，以确保受试者无法通过利用数据中的普通相关性来执行任务。从历史上看，大型机器学习和计算机视觉数据集缺乏这种控制。这导致模型必须为新数据集进行微调，并且在数据集中比在现实世界中更好。当在ObjectNet上进行测试时，由于偏差的控制，对象探测器在其他基准上的性能下显示了40-45％的性能下降。控制使ObjectNet强大到微调显示只有小的性能增加。我们开发了一个高度自动化的平台，可以通过众包图像捕获和注释将数据集收集与控件进行收集。 ObjectNet与ImageNet测试集（50,000张图像）的大小相同，并且通过设计不会与培训集配对，以便鼓励泛化。 DataSet既比想象成更容易 - 由于控件，对象大量居中和未被密封 - 更加困难。虽然我们专注于这里的对象识别，但是可以在整个机器学习中使用自动化工具以规模收集控制的数据，以生成以新方式锻炼模型的数据集，从而为研究人员提供有价值的反馈。这项工作开辟了概括，强大，更为人类的计算机视觉的研究以及创建数据集的研究，在那里结果是对现实世界表现的预测。

著录项

来源
《Conference on Neural Information Processing Systems》|2020年|p8761-9553|共11页
会议地点
作者
Andrei Barbu; David Mayo; Julian Alverio; William Luo; Christopher Wang; Dan Gutfreund; Joshua Tenenbaum; Boris Katz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计量学;
关键词

相似文献

外文文献
中文文献
专利

1. DVS Benchmark Datasets for Object Tracking, Action Recognition, and Object Recognition [J] . Yuhuang Hu, Hongjie Liu, Michael Pfeiffer, Frontiers in Neuroscience . 2016,第2016期

机译：用于对象跟踪，动作识别和对象识别的DVS基准数据集
2. Speed limit sign detection and recognition system using SVM and MNIST datasets [J] . Saadna Yassmina, Behloul Ali, Mezzoudj Saliha Neural computing & applications . 2019,第9期

机译：使用SVM和MNIST数据集的速度限制标志检测和识别系统
3. Performance evaluation of large-scale object recognition system using bag-of-visual words model [J] . Min-Uk Kim, Kyoungro Yoon Multimedia Tools and Applications . 2015,第7期

机译：基于视觉袋模型的大型目标识别系统性能评估
4. ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models [C] . Andrei Barbu, David Mayo, Julian Alverio, Conference on Neural Information Processing Systems . 2020

机译：ObjectNet：一个大规模的偏置控制数据集，用于推动对象识别模型的限制
5. Modeling economic and financial behavior from large-scale datasets. [D] . Mao, Huina. 2014

机译：从大规模数据集中对经济和金融行为进行建模。
6. DVS Benchmark Datasets for Object Tracking Action Recognition and Object Recognition [O] . Yuhuang Hu, Hongjie Liu, Michael Pfeiffer, 2016

机译：用于对象跟踪动作识别和对象识别的DVS基准数据集
7. A Large-Scale 3D Object Recognition dataset [O] . Sølund Thomas, Buch Anders Glent, Krüger Norbert, 2016

机译：大型3D对象识别数据集
8. Large-scale Benchmark Dataset for Event Recognition in Surveillance Video [R] . Oh, S., Hoogs, A., Perera, A., 2011

机译：用于监视视频中事件识别的大规模基准数据集

ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models

摘要

著录项

相似文献

相关主题

期刊订阅