首页> 外文会议>International Conference on Pattern Recognition >Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images
【24h】

Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images

机译:学习含蓄地代表来自多尺度特征和多视图图像的3D人体

获取原文

摘要

Reconstruction of 3D human bodies, from images, faces many challenges, due to it generally being an ill-posed problem. In this paper we present a method to reconstruct 3D human bodies from multi-view images, through learning an implicit function to represent 3D shape, based on multi-scale features extracted by multi-stage end-to-end neural networks. Our model consists of several end-to-end hourglass networks for extracting multi-scale features from multi-view images, and a fully connected network for implicit function classification from these features. Given a 3D point, it is projected to multi-view images and these images are fed into our model to extract multiscale features. The scales of features extracted by the hourglass networks decrease with the depth of our model, which represents the information from local to global scale. Then, the multi-scale features as well as the depth of the 3D point are combined to a new feature vector and the fully connected network classifies the feature vector, in order to predict if the point lies inside or outside of the 3D mesh. The advantage of our method is that we use both local and global features in the fully connected network and represent the 3D mesh by an implicit function, which is more memory-efficient. Experiments on public datasets demonstrate that our method surpasses previous approaches in terms of the accuracy of 3D reconstruction of human bodies from images.
机译:从图像中重建3D人体,面临许多挑战,这通常是一个弊端的问题。在本文中,我们介绍一种从多视图图像重建3D人体的方法,通过学习隐式功能来表示由多级端到端神经网络提取的多尺度特征来表示3D形状。我们的模型包括来自多视图图像的多尺度特征的多个端到端沙漏网络,以及来自这些功能的完全连接的网络。给定3D点,将其投影到多视图图像,并且这些图像被馈送到我们的模型中以提取多尺度特征。由沙漏网络提取的特征的尺度随着我们模型的深度而降低,这代表了来自本地到全局规模的信息。然后,将多尺度特征以及3D点的深度组合到新的特征向量,并且完全连接的网络对特征向量进行分类,以便预测点在3D网格的内部或外部。我们的方法的优点是我们在完全连接的网络中使用本地和全局特征,并通过隐式功能表示3D网格,这是更高的内容效率。公共数据集的实验表明,我们的方法在从图像中的3D重建的准确性方面超越了先前的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号