首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >SUN RGB-D: A RGB-D scene understanding benchmark suite
【24h】

SUN RGB-D: A RGB-D scene understanding benchmark suite

机译:Sun RGB-D:一个RGB-D场景了解基准套件

获取原文
获取外文期刊封面目录资料

摘要

Although RGB-D sensors have enabled major break-throughs for several vision tasks, such as 3D reconstruction, we have not attained the same level of success in high-level scene understanding. Perhaps one of the main reasons is the lack of a large-scale benchmark with 3D annotations and 3D evaluation metrics. In this paper, we introduce an RGB-D benchmark suite for the goal of advancing the state-of-the-arts in all major scene understanding tasks. Our dataset is captured by four different sensors and contains 10,335 RGB-D images, at a similar scale as PASCAL VOC. The whole dataset is densely annotated and includes 146,617 2D polygons and 64,595 3D bounding boxes with accurate object orientations, as well as a 3D room layout and scene category for each image. This dataset enables us to train data-hungry algorithms for scene-understanding tasks, evaluate them using meaningful 3D metrics, avoid overfitting to a small testing set, and study cross-sensor bias.
机译:虽然RGB-D传感器已经启用了几种视觉任务的主要断路器,例如3D重建,但我们在高级场景理解中没有达到同样的成功水平。也许是主要原因之一是缺乏带3D注释和3D评估指标的大规模基准。在本文中,我们介绍了RGB-D基准套件,以实现所有主要场景理解任务的最先进的目标。我们的数据集由四种不同的传感器捕获,并包含10,335 RGB-D图像,以类似的刻度为Pascal VOC。整个数据集是密集的注释,包括146,617个2D多边形和64,595个3D边界框,具有精确的对象方向,以及每个图像的3D房间布局和场景类别。此数据集使我们能够为现场了解任务培训数据饥饿的算法,使用有意义的3D指标评估它们,避免对小型测试集进行过度接种,并研究交叉传感器偏置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号