VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

机译：VidLoc：用于6自由度视频剪辑重新定位的深时空模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning techniques, namely convolutional neural networks (CNN) and regression forests, have recently shown great promise in performing 6-DoF localization of monocular images. However, in most cases image-sequences, rather only single images, are readily available. To this extent, none of the proposed learning-based approaches exploit the valuable constraint of temporal smoothness, often leading to situations where the per-frame error is larger than the camera motion. In this paper we propose a recurrent model for performing 6-DoF localization of video-clips. We find that, even by considering only short sequences (20 frames), the pose estimates are smoothed and the localization error can be drastically reduced. Finally, we consider means of obtaining probabilistic pose estimates from our model. We evaluate our method on openly-available real-world autonomous driving and indoor localization datasets.

机译：机器学习技术，即卷积神经网络（CNN）和回归森林，最近在执行单眼图像的6自由度定位中显示出了巨大的希望。然而，在大多数情况下，图像序列很容易获得，而不仅仅是单个图像。在此程度上，所提出的基于学习的方法均未利用时间平滑性的宝贵约束，通常会导致每帧误差大于摄像头运动的情况。在本文中，我们提出了用于执行视频片段的6自由度定位的递归模型。我们发现，即使仅考虑短序列（20帧），姿态估计也会变得平滑，并且可以大大减少定位误差。最后，我们考虑从模型中获得概率姿态估计的方法。我们在公开可用的现实世界自动驾驶和室内定位数据集上评估我们的方法。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2017年|2652-2660|共9页
会议地点
作者
Ronald Clark; Sen Wang; Andrew Markham; Niki Trigoni; Hongkai Wen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computational modeling; Cameras; Hidden Markov models; Feature extraction; Computer architecture; Three-dimensional displays; Computer vision;

机译：计算建模;相机;隐马尔可夫模型;特征提取;计算机体系结构;三维显示;计算机视觉;

相似文献

外文文献
中文文献
专利

1. DeepDSAIR: Deep 6-DOF camera relocalization using deblurred semantic-aware image representation for large-scale outdoor environments [J] . Esfahani Mandi Abolfazli, Wu Keyu, Yuan Shenghai, Image and Vision Computing . 2019,第Sepa期

机译：DeepDSAIR：针对大型室外环境使用去模糊的语义感知图像表示进行深度6自由度相机重新定位
2. Comparison of Deep Neural Networks and Deep Hierarchical Models for Spatio-Temporal Data [J] . Wikle Christopher K. Journal of Agricultural, Biological, and Environmental Statistics . 2019,第2期

机译：深度神经网络和深层分层模型的比较时空数据
3. INDOOR LIDAR RELOCALIZATION BASED ON DEEP LEARNING USING A 3D MODEL [J] . H. Zhao, D. Acharya, M. Tomko, International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2020,第4期

机译：基于使用3D模型的深度学习的室内激光雷达剖视
4. VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization [C] . Ronald Clark, Sen Wang, Andrew Markham, IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：vidloc：6-DOF视频夹重定位化的深度时空模型
5. Deep Learning Models for Spatio-temporal Forecasting and Analysis [D] . Asadi, Reza. 2020

机译：适用于时空预测和分析的深度学习模型
6. Spatio-Temporal Abnormal Behavior Prediction in Elderly Persons Using Deep Learning Models [O] . Meriem Zerkouk, Belkacem Chikhaoui 2020

机译：深度学习模型预测老年人的时空异常行为
7. VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization [O] . Clark, R, Wang, S, Markham, A, 2017

机译：VidLoc：用于6-DoF视频剪辑重定位的深度时空模型

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

摘要

著录项

相似文献

相关主题

期刊订阅