Athlete 3D pose estimation from a monocular TV sports video using pre-trained temporal convolutional networks

机译：运动员3D使用预先接受训练的时间卷积网络从单眼电视体育视频造成估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Our goal is to estimate athlete 3D pose from monocular TV sports video with a lower cost of collecting training data. To achieve this goal, we utilize a pre-trained deep neural network as a 3D pose estimator for estimating human 3D pose from 2D joint locations of the person in each image. Each image in popular datasets used for training such 3D pose estimator is obtained from a camera whose axis is parallel to the ground. On the other hand, since an image in TV sports video is generally taken from a bird’s eye view, joint locations of a human is distorted in the lower part of the image. Therefore, it is not appropriate to give 2D joint locations of the person directly to the pre-trained 3D pose estimator. To resolve this problem, we propose to correct 2D joint locations in an image of TV sports video by a homography transformation that maps the points in the image of TV sports video to the corresponding points in the image taken by the camera that captures training data for the 3D pose estimator. Experimental results show that the proposed method can estimate athlete 3D pose from monocular TV sports video.

机译：我们的目标是估算来自单眼电视体育视频的运动员3D姿势，收集培训数据的成本较低。为了实现这一目标，我们利用预先训练的深神经网络作为3D姿势估计器，用于估计来自每个图像中的人的2D联合位置的人3D姿势。用于训练这种3D姿势估计器的流行数据集中的每个图像是从轴与地面平行的相机获得的。另一方面，由于电视体育视频中的图像通常从鸟瞰图中取出，因此人的联合位置在图像的下部变形。因此，不合适地将该人的2D联合位置直接提供给预先训练的3D姿势估计器。要解决此问题，我们建议通过同住传播电视体育视频的图像中的2D联合位置，以便将电视体育视频图像图像中的点映射到捕获培训数据的相机拍摄的图像中的相应点3D姿势估计器。实验结果表明，该方法可以从单眼电视体育视频中估算运动员3D姿势。

著录项

来源
《IEEE International Conference on Systems, Man, and Cybernetics》|2020年|2615-2620|共6页
会议地点
作者
Tomoka Murakami; Takayuki Nakamura;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
3D pose estimation; sports video analysis; homography;

机译：3D姿势估计;体育视频分析;众同;

相似文献

外文文献
中文文献
专利

1. Multi-person 3D pose estimation from 3D cloud data using 3D convolutional neural networks [J] . Vasileiadis Manolis, Bouganis Christos-Savvas, Tzovaras Dimitrios Computer vision and image understanding . 2019,第AUGa期

机译：使用3D卷积神经网络从3D云数据进行多人3D姿势估计
2. Multi-person 3D pose estimation from 3D cloud data using 3D convolutional neural networks [J] . Vasileiadis Manolis, Bouganis Christos-Savvas, Tzovaras Dimitrios Computer vision and image understanding . 2019,第Auga期

机译：使用3D卷积神经网络3D云数据的多人3D姿态估计
3. 3D Head pose estimation and camera mouse implementation using a monocular video camera - Springer [J] . Masoomeh Nabati, Alireza Behrad Signal, Image and Video Processing . 2015,第1期

机译：使用单眼摄像机的3D头部姿势估计和摄像机鼠标实现-Springer
4. Athlete Pose Estimation from Monocular TV Sports Footage [C] . Fastovets Mykyta, Guillemaut Jean-Yves, Hilton Adrian IEEE Conference on Computer Vision and Pattern Recognition Workshops . 2013

机译：单眼电视体育镜头中的运动员姿势估计
5. Internal and External Feature Engineering Applied to Deep Learning with Convolutional Neural Networks for Monocular Relative Pose Estimation in Visual Odometry and Self-Localization [D] . Parkins, Franz Payton. 2020

机译：内部和外部特征工程应用于卷积神经网络的深度学习，用于视觉测量和自定位中的单眼相对姿态估计
6. Can pre-trained convolutional neural networks be directly used as a feature extractor for video-based neonatal sleep and wake classification? [O] . Muhammad Awais, Xi Long, Bin Yin, 2020

机译：可以预先训练的卷积神经网络直接用作基于视频的新生儿睡眠和唤醒分类的特征提取器吗？
7. Enhanced 3D Human Pose Estimation from Videos by Using Attention-Based Neural Network with Dilated Convolutions [O] . Ruixu Liu, Ju Shen, He Wang, 2021

机译：通过使用带有扩张卷积的关注的神经网络来增强视频的3D人类姿态估算

Athlete 3D pose estimation from a monocular TV sports video using pre-trained temporal convolutional networks

摘要

著录项

相似文献

相关主题

期刊订阅