首页> 外文期刊>IEEE transactions on visualization and computer graphics >A Sampling Approach to Generating Closely Interacting 3D Pose-Pairs from 2D Annotations
【24h】

A Sampling Approach to Generating Closely Interacting 3D Pose-Pairs from 2D Annotations

机译:从2D注释产生紧密交互3D姿势对的采样方法

获取原文
获取原文并翻译 | 示例
       

摘要

We introduce a data-driven method to generate a large number of plausible, closely interacting 3D human pose-pairs, for a given motion category, e.g., wrestling or salsa dance. With much difficulty in acquiring close interactions using 3D sensors, our approach utilizes abundant existing video data which cover many human activities. Instead of treating the data generation problem as one of reconstruction, either through 3D acquisition or direct 2D-to-3D data lifting from video annotations, we present a solution based on Markov Chain Monte Carlo (MCMC) sampling. Given a motion category and a set of video frames depicting the motion with the 2D pose-pair in each frame annotated, we start the sampling with one or few seed 3D pose-pairs which are manually created based on the target motion category. The initial set is then augmented by MCMC sampling around the seeds, via the Metropolis-Hastings algorithm and guided by a probability density function (PDF) that is defined by two terms to bias the sampling towards 3D pose-pairs that are physically valid and plausible for the motion category. With a focus on efficient sampling over the space of close interactions, rather than pose spaces, we develop a novel representation called interaction coordinates (IC) to encode both poses and their interactions in an integrated manner. Plausibility of a 3D pose-pair is then defined based on the IC and with respect to the annotated 2D pose-pairs from video. We show that our sampling-based approach is able to efficiently synthesize a large volume of plausible, closely interacting 3D pose-pairs which provide a good coverage of the input 2D pose-pairs.
机译:我们介绍一种数据驱动方法,以产生大量合理的,紧密地互动3D人姿势对,用于给定的运动类别,例如摔跤或莎莎舞蹈。在使用3D传感器获取密切交互的情况下,我们的方法利用丰富的现有视频数据来涵盖许多人类活动。通过从视频注释从3D获取或直接2D到3D数据提升,而不是将数据生成问题视为重建之一,而不是从视频注释提升,而是基于Markov链蒙特卡罗(MCMC)采样的解决方案。给定运动类别和一组视频帧,这些视频帧描绘了在每个帧中的2D姿态对的运动,我们开始使用基于目标运动类别手动创建的一个或几个种子3D姿势对的采样。然后通过MCMC采样通过MCMC采样来增强初始集,通过Metropolis-Hastings算法,并由概率密度函数(PDF)引导,该概率密度函数(PDF)由两个术语偏置为物理有效和合理的3D姿势对的抽样对于运动类别。通过重点关注在密切交互的空间上的有效采样,而不是姿势空间,我们开发一种名为交互坐标(IC)的新颖表示以综合方式编码两个姿势及其交互。然后基于IC和关于来自视频的注释的2D姿势对的3D姿态对的合格性。我们表明,我们的采样方法能够有效地合成大量的合理性,紧密地相互作用,其提供了输入2D姿势对的良好覆盖范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号