Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

Chen Yu; Shen Chunhua; Chen Hao; Wei Xiu-Shen; Liu Lingqiao; Yang Jian

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

【24h】

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

机译：对地标本地化结构知识的完全卷积网络的对抗学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Landmark/pose estimation in single monocular images has received much effort in computer vision due to its important applications. It remains a challenging task when input images come with severe occlusions caused by, e.g., adverse camera views. Under such circumstances, biologically implausible pose predictions may be produced. In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity. To address the problem, by incorporating priors about the structure of pose components, we propose a novel structure-aware fully convolutional network to implicitly take such priors into account during training of the deep network. Explicit learning of such constraints is typically challenging. Instead, inspired by how human identifies implausible poses, we design discriminators to distinguish the real poses from the fake ones (such as biologically implausible ones). If the pose generator G generates results that the discriminator fails to distinguish from real ones, the network successfully learns the priors. Training of the network follows the strategy of conditional Generative Adversarial Networks (GANs). The effectiveness of the proposed network is evaluated on three pose-related tasks: 2D human pose estimation, 2D facial landmark estimation and 3D human pose estimation. The proposed approach significantly outperforms several state-of-the-art methods and almost always generates plausible pose predictions, demonstrating the usefulness of implicit learning of structures using GANs.

机译：由于其重要的应用，单眼图像中的地标/姿态估计在计算机视觉中获得了很大的努力。当输入图像具有严重的闭塞时，它仍然是一个具有挑战性的任务，例如，造成的，例如，逆势相机视图。在这种情况下，可以产生生物学上难以置信的姿态预测。相比之下，人类的视觉能够通过利用地标点相互连接的几何约束来预测姿势。为了解决问题，通过将前沿纳入姿势组件的结构，我们提出了一种新颖的结构感知完全卷积网络，以在深网络培训期间隐含地考虑这样的前瞻。明确学习这种约束通常是具有挑战性的。相反，灵感来自人类如何识别令人难以置信的姿势，我们设计鉴别者以区分真实的姿势（如生物学上难以置信的问题）。如果姿势生成器G生成结果，则鉴别器无法与真实的结果区分，网络成功地学习了前提。网络培训遵循条件生成对冲网络（GANS）的策略。建议网络的有效性在三个姿势相关的任务中评估：2D人类姿势估计，2D面部地标估计和3D人类姿态估计。该提出的方法显着优于几种最先进的方法，并且几乎总是产生合理的姿态预测，展示了使用GANS隐含结构的有用性。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2020年第7期|1654-1669|共16页
作者
Chen Yu; Shen Chunhua; Chen Hao; Wei Xiu-Shen; Liu Lingqiao; Yang Jian;
展开▼
作者单位

Nanjing Univ Sci & Technol Jiangsu Key Lab Image & Video Understanding Socia Minist Educ Key Lab Intelligent Percept & Syst High Dimens In Nanjing 210094 Jiangsu Peoples R China;

Univ Adelaide Sch Comp Sci Adelaide SA 5005 Australia|Australian Ctr Robot Vis Brisbane Qld Australia;

Univ Adelaide Sch Comp Sci Adelaide SA 5005 Australia|Australian Ctr Robot Vis Brisbane Qld Australia;

Megvii Technol Megvii Res Nanjing Nanjing 210000 Jiangsu Peoples R China;

Univ Adelaide Sch Comp Sci Adelaide SA 5005 Australia|Australian Ctr Robot Vis Brisbane Qld Australia;

Nanjing Univ Sci & Technol Jiangsu Key Lab Image & Video Understanding Socia Minist Educ Key Lab Intelligent Percept & Syst High Dimens In Nanjing 210094 Jiangsu Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Pose estimation; Two dimensional displays; Three-dimensional displays; Heating systems; Task analysis; Training; Pose estimation; landmark localization; structure-aware network; adversarial training; multi-task learning; deep convolutional networks;

机译：姿态估计;二维显示器;三维显示器;加热系统;任务分析;培训;姿势估计;地标本地化;结构感知网络;对抗训练;多任务学习;深度卷积网络;
入库时间 2022-08-18 20:57:28

相似文献

外文文献
中文文献
专利

1. Learning Localized Representations of Point Clouds With Graph-Convolutional Generative Adversarial Networks [J] . Diego Valsesia, Giulia Fracastoro, Enrico Magli Multimedia, IEEE Transactions on . 2021,第1期

机译：用图形卷积生成的对抗网络学习点云的本地化表示
2. Copy-Move Forgery Detection and Localization Using a Generative Adversarial Network and Convolutional Neural-Network [J] . Younis Abdalla, M. Tariq Iqbal, Mohamed Shehata Information . 2019,第9期

机译：使用生成对抗网络和卷积神经网络进行复制移动伪造检测和定位
3. Local and non-local dependency learning and emergence of rule-like representations in speech data by deep convolutional generative adversarial networks [J] . Gasper Begus Computer speech and language . 2022,第Jana期

机译：深度卷积生成对冲网络，局部和非本地依赖学习和语音数据中的规则样式的出现
4. Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation [C] . Yu Chen, Chunhua Shen, Xiu-Shen Wei, IEEE International Conference on Computer Vision . 2017

机译：对抗人类：一种用于人类姿势估计的结构感知卷积网络
5. Internal and External Feature Engineering Applied to Deep Learning with Convolutional Neural Networks for Monocular Relative Pose Estimation in Visual Odometry and Self-Localization [D] . Parkins, Franz Payton. 2020

机译：内部和外部特征工程应用于卷积神经网络的深度学习，用于视觉测量和自定位中的单眼相对姿态估计
6. Structure-aware protein solubility prediction from sequence through graph convolutional network and predicted contact map [O] . Jianwen Chen, Shuangjia Zheng, Huiying Zhao, 2021

机译：通过图形卷积网络和预测联系地图的序列的结构感知蛋白质溶解度预测
7. Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization [O] . Yu Chen, Chunhua Shen, Hao Chen, 2020

机译：对地标本地化结构知识的完全卷积网络的对抗学习

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

摘要

著录项

相似文献

相关主题

期刊订阅