Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images

机译：学习含蓄地代表来自多尺度特征和多视图图像的3D人体

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reconstruction of 3D human bodies, from images, faces many challenges, due to it generally being an ill-posed problem. In this paper we present a method to reconstruct 3D human bodies from multi-view images, through learning an implicit function to represent 3D shape, based on multi-scale features extracted by multi-stage end-to-end neural networks. Our model consists of several end-to-end hourglass networks for extracting multi-scale features from multi-view images, and a fully connected network for implicit function classification from these features. Given a 3D point, it is projected to multi-view images and these images are fed into our model to extract multiscale features. The scales of features extracted by the hourglass networks decrease with the depth of our model, which represents the information from local to global scale. Then, the multi-scale features as well as the depth of the 3D point are combined to a new feature vector and the fully connected network classifies the feature vector, in order to predict if the point lies inside or outside of the 3D mesh. The advantage of our method is that we use both local and global features in the fully connected network and represent the 3D mesh by an implicit function, which is more memory-efficient. Experiments on public datasets demonstrate that our method surpasses previous approaches in terms of the accuracy of 3D reconstruction of human bodies from images.

机译：从图像中重建3D人体，面临许多挑战，这通常是一个弊端的问题。在本文中，我们介绍一种从多视图图像重建3D人体的方法，通过学习隐式功能来表示由多级端到端神经网络提取的多尺度特征来表示3D形状。我们的模型包括来自多视图图像的多尺度特征的多个端到端沙漏网络，以及来自这些功能的完全连接的网络。给定3D点，将其投影到多视图图像，并且这些图像被馈送到我们的模型中以提取多尺度特征。由沙漏网络提取的特征的尺度随着我们模型的深度而降低，这代表了来自本地到全局规模的信息。然后，将多尺度特征以及3D点的深度组合到新的特征向量，并且完全连接的网络对特征向量进行分类，以便预测点在3D网格的内部或外部。我们的方法的优点是我们在完全连接的网络中使用本地和全局特征，并通过隐式功能表示3D网格，这是更高的内容效率。公共数据集的实验表明，我们的方法在从图像中的3D重建的准确性方面超越了先前的方法。

著录项

来源
《International Conference on Pattern Recognition》|2021年|8968-8975|共8页
会议地点
作者
Zhongguo Li; Magnus Oskarsson; Anders Heyden;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Solid modeling; Three-dimensional displays; Shape; Biological system modeling; Neural networks; Feature extraction;

机译：培训;实体建模;三维显示器;形状;生物系统建模;神经网络;特征提取;

相似文献

外文文献
中文文献
专利

1. Projective Feature Learning for 3D Shapes with Multi-View Depth Images [J] . Xie Zhige, Xu Kai, Shan Wen, Computer Graphics Forum: Journal of the European Association for Computer Graphics . 2015,第7期

机译：具有多视图深度图像的3D形状的投影特征学习
2. Real-time 3D human body posture estimation by Kalman filter from multi-view sequential images [J] . Kengo Terada, Atsushi Okuda, Minoru Nakazawa, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2001,第713期

机译：通过卡尔曼滤波器从多视图顺序图像实时估计3D人体姿势
3. Real-time 3D human body posture estimation by Kalman filter from multi-view sequential images [J] . Kengo Terada, Atsushi Okuda, Minoru Nakazawa, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2001,第713期

机译：基于多视图顺序图像的卡尔曼滤波器实时3D人体姿势估算
4. A Multi-view Deep Learning Approach for Detecting Threats on 3D Human Body [C] . Zhicong Yan, Shuai Feng, Fangqi Li, International conference on communications, signal processing, and systems . 2020

机译：一种检测3D人体威胁的多视图深度学习方法
5. Reconstructing and Optimizing Natural Images Perceived by the Human Brain Based on Bayesian Deep Multi-View Learning [D] . Li, Xintong. 2021

机译：基于贝叶斯深度多视图学习的人脑重建和优化自然图像
6. ALS Point Cloud Classification by Integrating an Improved Fully Convolutional Network into Transfer Learning with Multi-Scale and Multi-View Deep Features [O] . Xiangda Lei, Hongtao Wang, Cheng Wang, 2020

机译：ALS点云分类通过将改进的完全卷积网络集成到传输学习以多尺度和多视图深度特征
7. Learning Monocular 3D Human Pose Estimation from Multi-view Images [O] . Helge Rhodin, Frederic Meyer, Jorg Sporri, 2018

机译：从多视图图像学习单眼3D人类姿势估计
8. 3D Model-Based Tracking of Humans in Action: A Multi-View Approach [R] . Gavrila, D. M., Davis, L. S. 1995

机译：基于三维模型的人类行动追踪：一种多视图方法

Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images

摘要

著录项

相似文献

相关主题

期刊订阅