首页> 外文会议>Conference on Neural Information Processing Systems >ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models
【24h】

ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models

机译:ObjectNet:一个大规模的偏置控制数据集,用于推动对象识别模型的限制

获取原文

摘要

We collect a large real-world test set, ObjectNet, for object recognition with controls where object backgrounds, rotations, and imaging viewpoints are random. Most scientific experiments have controls, confounds which are removed from the data, to ensure that subjects cannot perform a task by exploiting trivial correlations in the data. Historically, large machine learning and computer vision datasets have lacked such controls. This has resulted in models that must be fine-tuned for new datasets and perform better on datasets than in real-world applications. When tested on ObjectNet, object detectors show a 40-45% drop in performance, with respect to their performance on other benchmarks, due to the controls for biases. Controls make ObjectNet robust to fine-tuning showing only small performance increases. We develop a highly automated platform that enables gathering datasets with controls by crowdsourcing image capturing and annotation. ObjectNet is the same size as the ImageNet test set (50,000 images), and by design does not come paired with a training set in order to encourage generalization. The dataset is both easier than ImageNet - objects are largely centered and unoccluded - and harder, due to the controls. Although we focus on object recognition here, data with controls can be gathered at scale using automated tools throughout machine learning to generate datasets that exercise models in new ways thus providing valuable feedback to researchers. This work opens up new avenues for research in generalizable, robust, and more human-like computer vision and in creating datasets where results are predictive of real-world performance.
机译:我们收集一个大型真实测试集,ObjectNet,用于对象识别,控制对象背景,旋转和成像视点是随机的。大多数科学实验都有控制,从数据中删除的混淆,以确保受试者无法通过利用数据中的普通相关性来执行任务。从历史上看,大型机器学习和计算机视觉数据集缺乏这种控制。这导致模型必须为新数据集进行微调,并且在数据集中比在现实世界中更好。当在ObjectNet上进行测试时,由于偏差的控制,对象探测器在其他基准上的性能下显示了40-45%的性能下降。控制使ObjectNet强大到微调显示只有小的性能增加。我们开发了一个高度自动化的平台,可以通过众包图像捕获和注释将数据集收集与控件进行收集。 ObjectNet与ImageNet测试集(50,000张图像)的大小相同,并且通过设计不会与培训集配对,以便鼓励泛化。 DataSet既比想象成更容易 - 由于控件,对象大量居中和未被密封 - 更加困难。虽然我们专注于这里的对象识别,但是可以在整个机器学习中使用自动化工具以规模收集控制的数据,以生成以新方式锻炼模型的数据集,从而为研究人员提供有价值的反馈。这项工作开辟了概括,强大,更为人类的计算机视觉的研究以及创建数据集的研究,在那里结果是对现实世界表现的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号