首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization
【24h】

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

机译:VidLoc:用于6自由度视频剪辑重新定位的深时空模型

获取原文

摘要

Machine learning techniques, namely convolutional neural networks (CNN) and regression forests, have recently shown great promise in performing 6-DoF localization of monocular images. However, in most cases image-sequences, rather only single images, are readily available. To this extent, none of the proposed learning-based approaches exploit the valuable constraint of temporal smoothness, often leading to situations where the per-frame error is larger than the camera motion. In this paper we propose a recurrent model for performing 6-DoF localization of video-clips. We find that, even by considering only short sequences (20 frames), the pose estimates are smoothed and the localization error can be drastically reduced. Finally, we consider means of obtaining probabilistic pose estimates from our model. We evaluate our method on openly-available real-world autonomous driving and indoor localization datasets.
机译:机器学习技术,即卷积神经网络(CNN)和回归森林,最近在执行单眼图像的6自由度定位中显示出了巨大的希望。然而,在大多数情况下,图像序列很容易获得,而不仅仅是单个图像。在此程度上,所提出的基于学习的方法均未利用时间平滑性的宝贵约束,通常会导致每帧误差大于摄像头运动的情况。在本文中,我们提出了用于执行视频片段的6自由度定位的递归模型。我们发现,即使仅考虑短序列(20帧),姿态估计也会变得平滑,并且可以大大减少定位误差。最后,我们考虑从模型中获得概率姿态估计的方法。我们在公开可用的现实世界自动驾驶和室内定位数据集上评估我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号