首页> 外文学位 >Conditional models for three-dimensional human pose estimation.
【24h】

Conditional models for three-dimensional human pose estimation.

机译:三维人体姿态估计的条件模型。

获取原文
获取原文并翻译 | 示例

摘要

Human 3d pose estimation from monocular sequence is a challenging problem, owing to highly articulated structure of human body, varied anthropometry, self occlusion, depth ambiguities and large variability in the appearance and background in which humans may appear. Conventional vision based approaches to human 3d pose estimation mostly employed "top-down methods", which used a complete 3d human model, in a hypothesized pose, to explain the configuration of the humans in the observed 2d image. In this thesis, we work with "bottom-up methods" for human pose estimation, that use low level image features to directly predict 3d pose. The research draws on recent innovations in statistical learning, observation-driven modeling, stable image encodings, semi-supervised learning and learning perceptual representations. We address the problems of (a) modeling pose ambiguities due to 3d-to-2d projection and self occlusion, (b) lack of sufficient labeled data for training discriminative models and (c) high dimensionality of human 3d pose state space. In order to resolve 3d pose ambiguities, we use multi-valued functions to predict multiple plausible 3d poses for an image observation. We incorporate unlabeled data in a semi-supervised learning framework to constrain and improve the training of discriminative models. We also propose generic probabilistic Spectral Latent Variable Models to efficiently learn low dimensional representations of high dimensional observation data and apply it to the problem of human 3d pose inference.
机译:单眼序列的人类3d姿势估计是一个具有挑战性的问题,这是由于人体的高度关节结构,变化的人体测量学,自我遮挡,深度歧义以及人类可能出现的外观和背景的巨大差异。基于常规视觉的人类3d姿态估计方法大多采用“自上而下的方法”,该方法在假设的姿态下使用完整的3d人体模型来解释观察到的2d图像中人类的配置。在本文中,我们使用“自下而上的方法”进行人体姿势估计,该方法使用低级图像特征直接预测3d姿势。该研究利用了统计学习,观察驱动的建模,稳定的图像编码,半监督学习和学习感知表示方面的最新创新。我们解决了以下问题:(a)由于3d到2d投影和自我遮挡而对姿势歧义建模,(b)缺乏足够的标记数据来训练判别模型,以及(c)人类3d姿势状态空间的高维性。为了解决3d姿势的歧义,我们使用多值函数来预测图像观察的多个合理的3d姿势。我们将未标记的数据纳入半监督的学习框架中,以约束和改进判别模型的训练。我们还提出通用概率谱潜在变量模型,以有效地学习高维观测数据的低维表示并将其应用于人类3d姿态推断问题。

著录项

  • 作者

    Kanaujia, Atul.;

  • 作者单位

    Rutgers The State University of New Jersey - New Brunswick.;

  • 授予单位 Rutgers The State University of New Jersey - New Brunswick.;
  • 学科 Artificial Intelligence.Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 206 p.
  • 总页数 206
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号