首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >SUN RGB-D: A RGB-D scene understanding benchmark suite
【24h】

SUN RGB-D: A RGB-D scene understanding benchmark suite

机译:SUN RGB-D:RGB-D场景理解基准套件

获取原文

摘要

Although RGB-D sensors have enabled major break-throughs for several vision tasks, such as 3D reconstruction, we have not attained the same level of success in high-level scene understanding. Perhaps one of the main reasons is the lack of a large-scale benchmark with 3D annotations and 3D evaluation metrics. In this paper, we introduce an RGB-D benchmark suite for the goal of advancing the state-of-the-arts in all major scene understanding tasks. Our dataset is captured by four different sensors and contains 10,335 RGB-D images, at a similar scale as PASCAL VOC. The whole dataset is densely annotated and includes 146,617 2D polygons and 64,595 3D bounding boxes with accurate object orientations, as well as a 3D room layout and scene category for each image. This dataset enables us to train data-hungry algorithms for scene-understanding tasks, evaluate them using meaningful 3D metrics, avoid overfitting to a small testing set, and study cross-sensor bias.
机译:尽管RGB-D传感器已在一些视觉任务(例如3D重建)上实现了重大突破,但在高级场景理解方面我们仍未达到相同的成功水平。可能的主要原因之一是缺乏带有3D注释和3D评估指标的大规模基准测试。在本文中,我们引入了RGB-D基准套件,其目的是在所有主要场景理解任务中提高最新技术水平。我们的数据集由四个不同的传感器捕获,并包含10,335张RGB-D图像,其缩放比例与PASCAL VOC相似。整个数据集都进行了密集注释,其中包括146,617个2D多边形和64,595个3D边界框,它们具有精确的对象方向,以及每个图像的3D房间布局和场景类别。该数据集使我们能够训练需要大量数据的算法来了解场景的任务,使用有意义的3D指标对其进行评估,避免过小地适应较小的测试集,并研究跨传感器偏差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号