首页> 外文会议>IEEE International Conference on Artificial Intelligence and Virtual Reality >A Method to Build Multi-Scene Datasets for CNN for Camera Pose Regression
【24h】

A Method to Build Multi-Scene Datasets for CNN for Camera Pose Regression

机译:用于构建CNN的多场景数据集的方法,用于相机姿势回归

获取原文

摘要

Convolutional neural networks (CNN) have shown to be useful for camera pose regression, and They have robust effects against some challenging scenarios such as lighting changes, motion blur, and scenes with lots of textureless surfaces. Additionally, PoseNet shows that the deep learning system can interpolate the camera pose in space between training images. In this paper, we explore how different strategies for processing datasets will affect the pose regression and propose a method for building multi-scene datasets for training such neural networks. We demonstrate that the location of several scenes can be remembered using only one neural network. By combining multiple scenes, we found that the position errors of the neural network do not decrease significantly as the distance between the cameras increases, which means that we do not need to train several models for the increase number of scenes. We also explore the impact factors that influence the accuracy of models for multi-scene camera pose regression, which can help us merge several scenes into one dataset in a better way. We opened our code and datasets to the public for better researches.
机译:卷积神经网络(CNN)已显示对相机姿势回归有用,并且对某些具有挑战性的情景具有强大的影响,例如照明变化,运动模糊和具有许多Textullesfless的场景。此外,Poshet表明深度学习系统可以在训练图像之间插入相机姿势。在本文中,我们探讨了处理数据集的不同策略将如何影响姿势回归,并提出用于构建用于训练此类神经网络的多场景数据集的方法。我们证明可以仅使用一个神经网络来记住若干场景的位置。通过组合多个场景,我们发现神经网络的位置误差不会随着摄像机之间的距离而显着减小,这意味着我们不需要培训几个模型以增加场景数量。我们还探讨了影响多场景摄像机姿势回归模型准确性的影响因素,这可以帮助我们以更好的方式将若干场景合并到一个数据集中。我们向公众开设了我们的代码和数据集以获得更好的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号