...
首页> 外文期刊>Neurocomputing >Spatial-temporal modeling for prediction of stylized human motion
【24h】

Spatial-temporal modeling for prediction of stylized human motion

机译:用于预测程式化人体运动的时空建模

获取原文
获取原文并翻译 | 示例
           

摘要

Human motion prediction refers to forecasting human motion in the future given a past motion sequence, which has significant applications in human tracking, automatic motion generation, autonomous driving, human-robotics interaction, etc. Previous works usually used RNN-based methods, focusing on modeling the temporal dynamics of human motion, which have made great effort on content motions. However, it is unclear for their performance on stylized motion, which is with more expressive emotions and states of the human motion. Different styles within the same motion type have similar motion patterns but also subtle variances. This makes it difficult to be predicted. The main idea of this paper is to learn the spatial characteristic of stylized motion and combine it with the temporal dynamics to achieve accurate prediction. We adopt a transformer-based style encoder to learn the motion representation in the pose space and then maps it to the latent space modeled by the constant variance Gaussian mixture model; meanwhile, we use the hierarchical multi-scale RNN as a temporal encoder to capture the temporal dynamics of human motion; finally, we feed the spatial and temporal features into the prediction decoder to predict the next frame. Our experiments on the Human 3.6 M and Stylized MotionDatasets demonstrate that our model has comparable prediction performance with the state-of-the-art motion prediction works on Human 3.6 M and outperforms previous works on stylized human motion prediction. (C) 2022 Elsevier B.V. All rights reserved.
机译:人体运动预测是指根据过去的运动序列预测未来的人体运动,在人体跟踪、自动运动生成、自动驾驶、人机交互等方面具有重要应用。以往的工作通常使用基于RNN的方法,专注于对人体运动的时间动力学进行建模,在内容运动上下了很大的功夫。然而,目前尚不清楚它们在程式化运动中的表现,即更具表现力的情感和人体运动状态。同一运动类型中的不同样式具有相似的运动模式,但也有细微的变化。这使得它很难被预测。本文的主要思想是学习程式化运动的空间特征,并将其与时间动力学相结合,以实现准确的预测。我们采用基于Transformer的编码器来学习姿态空间中的运动表示,然后将其映射到由恒定方差高斯混合模型建模的潜在空间;同时,我们使用分层多尺度RNN作为时间编码器来捕捉人体运动的时间动态;最后,我们将空间和时间特征输入预测解码器以预测下一帧。我们在 Human 3.6 M 和 Stylized MotionDatasets 上的实验表明,我们的模型与 Human 3 上最先进的运动预测工作具有相当的预测性能。6 M,优于以前在程式化人体运动预测方面的工作。(c) 2022 年爱思唯尔 B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号