首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization
【24h】

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

机译:对地标本地化结构知识的完全卷积网络的对抗学习

获取原文
获取原文并翻译 | 示例
       

摘要

Landmark/pose estimation in single monocular images has received much effort in computer vision due to its important applications. It remains a challenging task when input images come with severe occlusions caused by, e.g., adverse camera views. Under such circumstances, biologically implausible pose predictions may be produced. In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity. To address the problem, by incorporating priors about the structure of pose components, we propose a novel structure-aware fully convolutional network to implicitly take such priors into account during training of the deep network. Explicit learning of such constraints is typically challenging. Instead, inspired by how human identifies implausible poses, we design discriminators to distinguish the real poses from the fake ones (such as biologically implausible ones). If the pose generator G generates results that the discriminator fails to distinguish from real ones, the network successfully learns the priors. Training of the network follows the strategy of conditional Generative Adversarial Networks (GANs). The effectiveness of the proposed network is evaluated on three pose-related tasks: 2D human pose estimation, 2D facial landmark estimation and 3D human pose estimation. The proposed approach significantly outperforms several state-of-the-art methods and almost always generates plausible pose predictions, demonstrating the usefulness of implicit learning of structures using GANs.
机译:由于其重要的应用,单眼图像中的地标/姿态估计在计算机视觉中获得了很大的努力。当输入图像具有严重的闭塞时,它仍然是一个具有挑战性的任务,例如,造成的,例如,逆势相机视图。在这种情况下,可以产生生物学上难以置信的姿态预测。相比之下,人类的视觉能够通过利用地标点相互连接的几何约束来预测姿势。为了解决问题,通过将前沿纳入姿势组件的结构,我们提出了一种新颖的结构感知完全卷积网络,以在深网络培训期间隐含地考虑这样的前瞻。明确学习这种约束通常是具有挑战性的。相反,灵感来自人类如何识别令人难以置信的姿势,我们设计鉴别者以区分真实的姿势(如生物学上难以置信的问题)。如果姿势生成器G生成结果,则鉴别器无法与真实的结果区分,网络成功地学习了前提。网络培训遵循条件生成对冲网络(GANS)的策略。建议网络的有效性在三个姿势相关的任务中评估:2D人类姿势估计,2D面部地标估计和3D人类姿态估计。该提出的方法显着优于几种最先进的方法,并且几乎总是产生合理的姿态预测,展示了使用GANS隐含结构的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号