首页> 外文会议>IEEE International Conference on Systems, Man, and Cybernetics >Athlete 3D pose estimation from a monocular TV sports video using pre-trained temporal convolutional networks
【24h】

Athlete 3D pose estimation from a monocular TV sports video using pre-trained temporal convolutional networks

机译:运动员3D使用预先接受训练的时间卷积网络从单眼电视体育视频造成估计

获取原文

摘要

Our goal is to estimate athlete 3D pose from monocular TV sports video with a lower cost of collecting training data. To achieve this goal, we utilize a pre-trained deep neural network as a 3D pose estimator for estimating human 3D pose from 2D joint locations of the person in each image. Each image in popular datasets used for training such 3D pose estimator is obtained from a camera whose axis is parallel to the ground. On the other hand, since an image in TV sports video is generally taken from a bird’s eye view, joint locations of a human is distorted in the lower part of the image. Therefore, it is not appropriate to give 2D joint locations of the person directly to the pre-trained 3D pose estimator. To resolve this problem, we propose to correct 2D joint locations in an image of TV sports video by a homography transformation that maps the points in the image of TV sports video to the corresponding points in the image taken by the camera that captures training data for the 3D pose estimator. Experimental results show that the proposed method can estimate athlete 3D pose from monocular TV sports video.
机译:我们的目标是估算来自单眼电视体育视频的运动员3D姿势,收集培训数据的成本较低。为了实现这一目标,我们利用预先训练的深神经网络作为3D姿势估计器,用于估计来自每个图像中的人的2D联合位置的人3D姿势。用于训练这种3D姿势估计器的流行数据集中的每个图像是从轴与地面平行的相机获得的。另一方面,由于电视体育视频中的图像通常从鸟瞰图中取出,因此人的联合位置在图像的下部变形。因此,不合适地将该人的2D联合位置直接提供给预先训练的3D姿势估计器。要解决此问题,我们建议通过同住传播电视体育视频的图像中的2D联合位置,以便将电视体育视频图像图像中的点映射到捕获培训数据的相机拍摄的图像中的相应点3D姿势估计器。实验结果表明,该方法可以从单眼电视体育视频中估算运动员3D姿势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号