Improving 3D Human Pose Estimation Via 3D Part Affinity Fields

机译：通过3D零件亲和力场改善3D人体姿势估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

3D human pose estimation from monocular images has become a heated area in computer vision recently. For years, most deep neural network based practices have adopted either an end-to-end approach, or a two-stage approach. An end-to-end network typically estimates 3D human poses directly from 2D input images, but it suffers from the shortage of 3D human pose data. It is also obscure to know if the inaccuracy stems from limited visual under-standing or 2D-to-3D mapping. Whereas a two-stage directly lifts those 2D keypoint outputs to the 3D space, after utilizing an existing network for 2D keypoint detections. However, they tend to ignore some useful contextual hints from the 2D raw image pixels. In this paper, we introduce a two-stage architecture that can eliminate the main disadvantages of both these approaches. During the first stage we use an existing state-of-the-art detector to estimate 2D poses. To add more con-textual information to help lifting 2D poses to 3D poses, we propose 3D Part Affinity Fields (3D-PAFs). We use 3D-PAFs to infer 3D limb vectors, and combine them with 2D poses to regress the 3D coordinates. We trained and tested our proposed framework on Human3.6M, the most popular 3D human pose benchmark dataset. Our approach achieves the state-of-the-art performance, which proves that with right selections of contextual information, a simple regression model can be very powerful in estimating 3D poses.

机译：最近，单眼图像的3D人体姿势估计已成为计算机视觉中的热点领域。多年来，大多数基于深度神经网络的实践都采用端到端方法或两阶段方法。端到端网络通常直接从2D输入图像中估计3D人体姿势，但是它遭受3D人体姿势数据不足的困扰。不清楚这种误差是否是由于有限的视觉理解或2D到3D映射所致。而在利用现有网络进行2D关键点检测之后，分两阶段将这些2D关键点输出直接提升到3D空间。但是，他们倾向于忽略2D原始图像像素中的一些有用的上下文提示。在本文中，我们介绍了一个两阶段的体系结构，可以消除这两种方法的主要缺点。在第一阶段中，我们使用现有的最先进的检测器来估算2D姿态。为了添加更多上下文信息以帮助将2D姿势提升为3D姿势，我们提出了3D零件相似性字段（3D-PAF）。我们使用3D-PAF来推断3D肢体向量，并将其与2D姿势结合起来以回归3D坐标。我们在Human3.6M（最流行的3D人体姿势基准数据集）上训练并测试了我们提出的框架。我们的方法达到了最先进的性能，这证明了通过正确选择上下文信息，简单的回归模型可以非常强大地估计3D姿势。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2019年|1004-1013|共10页
会议地点
作者
Ding Liu; Zixu Zhao; Xinchao Wang; Yuxiao Hu; Lei Zhang; Thomas Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Three-dimensional displays; Two dimensional displays; Pose estimation; Heating systems; Task analysis; Neural networks; Joining processes;

机译：三维显示;二维显示;姿态估计;加热系统;任务分析;神经网络;连接过程;

相似文献

外文文献
中文文献
专利

1. 2D-3D pose consistency-based conditional random fields for 3D human pose estimation [J] . Ju Yong Chang, Kyoung Mu Lee Computer vision and image understanding . 2018,第apra期

机译：基于2D-3D姿态一致性的3D人体姿态估计条件随机场
2. A Novel Joint Chaining Graph Model for Human Pose Estimation on 2D Action Videos and Facial Pose Estimation on 3D Images [J] . D.Ratna kishore, M. Chandra Mohan, Akepogu. Ananda Rao International Journal of Image, Graphics and Signal Processing . 2017,第3期

机译：一种用于2D动作视频上的人体姿势估计和3D图像上的面部姿势估计的新型联合链图模型
3. Multiview 3D human pose estimation using improved least-squares and LSTM networks [J] . Carlos Nunez Juan, Cabido Raid, Velez Jose F., Neurocomputing . 2019,第JANa5期

机译：使用改进的最小二乘法和LSTM网络进行多视图3D人体姿势估计
4. Improving 3D Human Pose Estimation Via 3D Part Affinity Fields [C] . Ding Liu, Zixu Zhao, Xinchao Wang, IEEE Winter Conference on Applications of Computer Vision . 2019

机译：通过3D零件亲和力领域改善3D人类姿势估计
5. On the 3D point cloud for human-pose estimation. [D] . Chan, Kai-Chi. 2016

机译：在3D点云上进行人体姿势估计。
6. Highly Accurate and Fully Automatic 3D Head Pose Estimation and Eye Gaze Estimation Using RGB-D Sensors and 3D Morphable Models [O] . Reza Shoja Ghiass, Ognjen Arandjelovć, Denis Laurendeau 2018

机译：使用RGB-D传感器和3D可变形模型的高精度和全自动3D头部姿势估计和视线估计
7. 2D-3D Pose Consistency-based Conditional Random Fields for 3D Human Pose Estimation [O] . Chang, Ju Yong, Lee, Kyoung Mu 2017

机译：基于2D-3D姿态一致性的3D人体姿势条件随机场估计

Improving 3D Human Pose Estimation Via 3D Part Affinity Fields

摘要

著录项

相似文献

相关主题

期刊订阅