Sequence-to-Sequence Learning for Human Pose Correction in Videos

机译：视频中人体姿势校正的序列到序列学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The power of ConvNets has been demonstrated in a wide variety of vision tasks including pose estimation. But they often produce absurdly erroneous predictions in videos due to unusual poses, challenging illumination, blur, self-occlusions etc. These erroneous predictions can be refined by leveraging previous and future predictions as the temporal smoothness constrain in the videos. In this paper, we present a generic approach for pose correction in videos using sequence learning that makes minimal assumptions on the sequence structure. The proposed model is generic, fast and surpasses the state-of-the-art on benchmark datasets. We use a generic pose estimator for initial pose estimates, which are further refined using our method. The proposed architecture uses Long Short-Term Memory (LSTM) encoder-decoder model to encode the temporal context and refine the estimations. We show 3.7% gain over the baseline Yang & Ramanan (YR) and 2.07% gain over Spatial Fusion Network (SFN) on a new challenging YouTube Pose Subset dataset.

机译：ConvNets的功能已在包括姿势估计在内的各种视觉任务中得到了证明。但是由于异常的姿势，具有挑战性的照明，模糊，自遮挡等原因，它们通常会在视频中产生荒谬的错误预测。这些错误的预测可以通过利用先前和将来的预测来完善，因为视频中的时间平滑度受到限制。在本文中，我们介绍了一种使用序列学习对视频进行姿势校正的通用方法，该方法对序列结构进行了最小假设。所提出的模型通用，快速并且超越了基准数据集的最新技术。我们将通用姿态估计器用于初始姿态估计，然后使用我们的方法进一步完善。所提出的体系结构使用长短期存储器（LSTM）编码器/解码器模型对时间上下文进行编码并完善估计。在新的具有挑战性的YouTube姿势子集数据集上，我们显示出比基线Yang＆Ramanan（YR）增长了3.7％，比Spatial Fusion Network（SFN）增长了2.07％。

著录项

来源
《IAPR Asian Conference on Pattern Recognition》|2017年|298-303|共6页
会议地点 Nanjing(CN)
作者
Sirnam Swetha; Vineeth N Balasubramanian; C.V. Jawahar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Videos; Decoding; Computational modeling; Pose estimation; YouTube; Predictive models; Sports;

机译：影片；解码;计算建模；姿势估计； YouTube；预测模型；体育;

相似文献

外文文献
中文文献
专利

1. A Novel Joint Chaining Graph Model for Human Pose Estimation on 2D Action Videos and Facial Pose Estimation on 3D Images [J] . D.Ratna kishore, M. Chandra Mohan, Akepogu. Ananda Rao International Journal of Image, Graphics and Signal Processing . 2017,第3期

机译：一种用于2D动作视频上的人体姿势估计和3D图像上的面部姿势估计的新型联合链图模型
2. Human Pose Estimation in Video via Structured Space Learning and Halfway Temporal Evaluation [J] . Liu Shiguang, Li Yang, Hua Guoguang IEEE Transactions on Circuits and Systems for Video Technology . 2019,第7期

机译：通过结构化空间学习和中途时间评估来估计视频中的人体姿势
3. Correction to: Camera localization for a human-pose in 3D space using a single 2D human-pose image with landmarks: a multimedia social network emerging demand [J] . Al-Hami Motaz, Lakaemper Rolf, Rawashdeh Majdi, Multimedia Tools and Applications . 2019,第3期

机译：更正为：使用具有地标的单个2D人身图像在3D空间中为人身相机定位：多媒体社交网络的新兴需求
4. Sequence-to-Sequence Learning for Human Pose Correction in Videos [C] . Sirnam Swetha, Vineeth N Balasubramanian, C.V. Jawahar IAPR Asian Conference on Pattern Recognition . 2017

机译：视频中人类姿势校正的序列到序列学习
5. Recognition of Human Actions based on 3D Pose Estimation via Monocular Video Sequences. [D] . Ke, Shian-Ru. 2014

机译：通过单眼视频序列基于3D姿势估计的人类动作识别。
6. Correction: A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos [O] . Amirhossein Aghamohammadi, Mei Choo Ang, Elankovan A. Sundararajan, -1

机译：校正：并行时空显着性和判别式在线学习方法用于航拍视频中的视觉目标跟踪
7. Learning Markerless Human Pose Estimation from Multiple Viewpoint Video [O] . Trumble Matthew, Gilbert Andrew, Hilton Adrian, 2016

机译：从多角度视频中学习无标记的人体姿势估计

Sequence-to-Sequence Learning for Human Pose Correction in Videos

摘要

著录项

相似文献

相关主题

期刊订阅