VST3D-Net:Video-Based Spatio-Temporal Network for 3D Shape Reconstruction from a Video

机译：VST3D-Net：基于视频的时空网络，用于视频3D形状重建

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose the Video-based Spatio-Temporal 3D Network (VST3D-Net), which is a novel learning approach of viewpoint-invariant 3D shape reconstruction from monocular video. In our VST3D-Net, a spatial feature extraction subnetwork is designed to encode the local and global spatial relationships of the object in the image. The extracted latent spatial features have implicitly embedded both shape and pose information. Although a single view can also be used to recover a 3D shape, more rich shape information of the dynamic object can be explored and leveraged from video frames. To generate the viewpoint-free 3D shape, we design a temporal correlation feature extractor. It handles the temporal consistency of the shape and pose of the moving object simultaneously. Therefore, both the canonical 3D shape and the corresponding pose at different frame are recovered by the network. We validate our approach on the ShapeNet-based video dataset and ApolloCar3D dataset. The experimental results show the proposed VST3D-Net can outperform the state-of-the-art approaches both in accuracy and efficiency.

机译：在本文中，我们提出了一种基于视频的时空3D网络（VST3D-Net），这是一种从单目一象视频的观点不变三维形状重建的新学习方法。在VST3D-Net中，空间特征提取子网旨在编码图像中对象的本地和全局空间关系。提取的潜在空间特征隐含地嵌入了形状和姿势信息。虽然单个视图也可用于恢复3D形状，但是可以探索动态对象的更丰富的形状信息和从视频帧中杠杆。要生成无视点3D形状，我们设计了一个时间相关特征提取器。它同时处理移动物体的形状和姿势的时间一致性。因此，通过网络恢复不同帧的规范3D形状和相应的姿势。我们在基于ShapEnet的视频数据集和Apollocar3D数据集中验证了我们的方法。实验结果表明，所提出的VST3D-Net可以在准确性和效率方面优于最先进的方法。

著录项

来源
《International Conference on 3D Immersion》|2020年|1-7|共7页
会议地点
作者
Jinglun Yang; Guanglun Zhang; Youhua Li; Lu Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Three-dimensional displays; Shape; Heuristic algorithms; Pose estimation; Feature extraction; Data mining; Image reconstruction;

机译：三维显示器;形状;启发式算法;姿势估计;特征提取;数据挖掘;图像重建;

相似文献

外文文献
中文文献
专利

1. Video-based person re-identification via spatio-temporal attentional and two-stream fusion convolutional networks [J] . Ouyang Deqiang, Zhang Yonghui, Shao Jie Pattern recognition letters . 2019,第JANa期

机译：通过时空注意和两流融合卷积网络的基于视频的人重新识别
2. A novel algorithm and hardware architecture for fast video-based shape reconstruction of space debris [J] . Stefano Di Carlo, Paolo Prinetto, Daniele Rolfo, EURASIP journal on advances in signal processing . 2014,第1期

机译：一种基于视频的空间碎片快速形状重构的新算法和硬件架构
3. Video-Based Human Action Recognition Using Spatial Pyramid Pooling and 3D Densely Convolutional Networks [J] . Wanli Yang, Yimin Chen, Chen Huang, Future Internet . 2018,第12期

机译：使用空间金字塔池和3D密集卷积网络的基于视频的人类动作识别
4. 3D Reconstruction and Video-Based Rendering of Casually Captured Videos [C] . Aparna Taneja, Luca Ballan, Jens Puwein, International Seminar on Video Processing and Computational Video . 2011

机译：随机捕获视频的三维重建与视频渲染
5. Spatio-temporal framework and algorithms for video-based face recognition [D] . Yang, John See Su 2014

机译：基于视频的人脸识别的时空框架和算法
6. Reconstruction of 3D Object Shape Using Hybrid Modular Neural Network Architecture Trained on 3D Models from ShapeNetCore Dataset [O] . Audrius Kulikajevas, Rytis Maskeliūnas, Robertas Damaševičius, 2019

机译：使用基于ShapeNetCore数据集的3D模型训练的混合模块化神经网络体系结构重建3D对象形状
7. A novel algorithm and hardware architecture for fast video-based shape reconstruction of space debris [O] . Stefano Di Carlo, Paolo Prinetto, Daniele Rolfo, 2014

机译：一种基于视频的空间碎片形状快速重构的新算法和硬件架构

VST3D-Net:Video-Based Spatio-Temporal Network for 3D Shape Reconstruction from a Video

摘要

著录项

相似文献

相关主题

期刊订阅