Vision-as-Inverse-Graphics: Obtaining a Rich 3D Explanation of a Scene from a Single Image

机译：逆像视觉：从单个图像获取场景的丰富3D解释

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We develop an inverse graphics approach to the problem of scene understanding, obtaining a rich representation that includes descriptions of the objects in the scene and their spatial layout, as well as global latent variables like the camera parameters and lighting. The framework's stages include object detection, the prediction of the camera and lighting variables, and prediction of object-specific variables (shape, appearance and pose). This acts like the encoder of an autoencoder, with graphics rendering as the decoder Importantly the scene representation is interpretable and is of variable dimension to match the detected number of objects plus the global variables. For the prediction of the camera latent variables we introduce a novel architecture termed Probabilistic HoughNets (PHNs), which provides a principled approach to combining information from multiple detections. We demonstrate the quality of the reconstructions obtained quantitatively on synthetic data, and qualitatively on real scenes.

机译：我们开发了一种逆向图形方法来解决场景理解问题，获得了丰富的表示形式，其中包括场景中对象的描述及其空间布局，以及像摄像机参数和照明这样的全局潜在变量。框架的阶段包括对象检测，相机和照明变量的预测以及特定于对象的变量（形状，外观和姿势）的预测。这就像自动编码器的编码器一样，以图形渲染作为解码器。重要的是，场景表示形式是可解释的，并且具有可变的维度，以匹配检测到的对象数和全局变量。为了预测摄像机的潜在变量，我们引入了一种称为概率HoughNets（PHN）的新颖体系结构，该体系结构提供了一种原理方法来组合来自多个检测的信息。我们证明了在合成数据上定量获得的重建质量，以及在真实场景上定性获得的重建质量。

著录项

来源
《IEEE International Conference on Computer Vision Workshops》|2017年|940-948|共9页
会议地点
作者
Lukasz Romaszko; Christopher K. I. Williams; Pol Moreno; Pushmeet Kohli;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cameras; Probabilistic logic; Lighting; Detectors; Object detection; Graphics; Transforms;

机译：相机;概率逻辑;照明;探测器;目标检测;图形;变换;
入库时间 2022-08-26 14:32:28

相似文献

外文文献
中文文献
专利

1. Obtaining pseudo-3D information from single-plane X-ray imaging [J] . J. Hrdy, P.Oberta Nuclear Instruments & Methods in Physics Research . 2012,第期

机译：从单平面X射线成像获取伪3D信息
2. 3D Scene Reconstruction with Sparse LiDAR Data and Monocular Image in Single Frame [J] . Yuanxin Zhong, Sijia Wang, Shichao Xie, SAE International Journal of Passenger Cars - Electronic and Electrical Systems . 2018,第1期

机译：3d与稀疏的激光雷达数据和单目象的场面重建在单个框架中
3. Single image-based 3D scene estimation from semantic prior [J] . Hwang Hyeong Jae, Yoon Sang Min Electronics Letters . 2015,第22期

机译：基于语义先验的基于单个图像的3D场景估计
4. Vision-as-Inverse-Graphics: Obtaining a Rich 3D Explanation of a Scene from a Single Image [C] . Lukasz Romaszko, Christopher K. I. Williams, Pol Moreno, IEEE International Conference on Computer Vision Workshops . 2017

机译：视觉和逆图：从单个图像获取富有的3D解释场景
5. Learning Single-view 3D Reconstruction of Objects and Scenes [D] . Tulsiani, Shubham. 2018

机译：学习对象和场景的单视图3D重建
6. 3D MR Neurography of the Lumbosacral Plexus: Obtaining Optimal Images for Selective Longitudinal Nerve Depiction [O] . G. Cho Sims, E. Boothe, R. Joodi, 2016

机译：3D腰骶神经丛的MR神经统治法：获得选择性纵向神经描绘的最佳图像
7. Make3D: Learning 3D Scene Structure from a Single Still Image [O] . Ashutosh Saxena, Min Sun, Andrew Y. Ng 2009

机译：Make3D：从单个静止图像学习3D场景结构

Vision-as-Inverse-Graphics: Obtaining a Rich 3D Explanation of a Scene from a Single Image

摘要

著录项

相似文献

相关主题

期刊订阅