...
首页> 外文期刊>Neurocomputing >Spatial-aware stacked regression network for real-time 3D hand pose estimation
【24h】

Spatial-aware stacked regression network for real-time 3D hand pose estimation

机译:用于实时3D手姿势估计的空间感知堆叠回归网络

获取原文
获取原文并翻译 | 示例
           

摘要

Making full use of the spatial information of the depth data is crucial for 3D hand pose estimation from a single depth image. In this paper, we propose a Spatial-aware Stacked Regression Network (SSRN) for fast, robust and accurate 3D hand pose estimation from a single depth image. By adopting a differentiable pose re-parameterization process, our method efficiently encodes the pose-dependent 3D spatial struc-ture of the depth data as spatial-aware representations. Taking such spatial-aware representations as inputs, the stacked regression network utilizes multi-joint spatial context and the 3D spatial relationship between the estimated pose and the depth data to predict a refined hand pose. To further improve the estimation accuracy, we adopt a spatial attention mechanism to reduce the influence of irrelevant fea-tures for pose regression. In order to improve the speed of the network, we propose a cross-stage self-distillation mechanism to distill knowledge within the network itself. Experiments on four datasets show that our proposed method achieves state-of-the-art accuracy with high running speed around 330 FPS on a single GPU and 35 FPS on a single CPU.(c) 2021 Elsevier B.V. All rights reserved.
机译:充分利用深度数据的空间信息对于来自单个深度图像的3D手姿势估计至关重要。在本文中,我们提出了一种空间感知堆叠回归网络(SSRN),用于从单个深度图像的快速,鲁棒和准确的3D手姿势估计。通过采用可分辨率的姿势重新参数化进程,我们的方法有效地将深度数据的姿势依赖于依赖的3D空间结构进行了有效地编码为空间感知的表示。以这样的空间感知的表示作为输入,堆叠的回归网络利用多关节空间上下文和所估计的姿态和深度数据之间的三维空间关系来预测精制手姿态。为了进一步提高估计准确性,我们采用空间注意机制来减少对姿势回归的无关FEA的影响。为了提高网络的速度,我们提出了一种跨舞台自蒸馏机制来蒸馏在网络本身内的知识。四个数据集上的实验表明,我们的提出方法在单个GPU上的330 FPS上实现了最先进的准确度,在单个GPU上,在单个CPU上的35个FPS。(c)2021 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2021年第21期|42-57|共16页
  • 作者单位

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

    Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    3D hand pose estimation; Depth image; Knowledge distillation;

    机译:3D手姿势估计;深度图像;知识蒸馏;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号