首页> 外文会议>IEEE International Conference on Multimedia and Expo >SKELETON-BASED ACTION RECOGNITION WITH SYNCHRONOUS LOCAL AND NON-LOCAL SPATIO-TEMPORAL LEARNING AND FREQUENCY ATTENTION
【24h】

SKELETON-BASED ACTION RECOGNITION WITH SYNCHRONOUS LOCAL AND NON-LOCAL SPATIO-TEMPORAL LEARNING AND FREQUENCY ATTENTION

机译:基于骨架的动作识别,具有同步本地和非本地时空学习和频率关注

获取原文

摘要

Benefiting from its succinctness and robustness, skeleton-based action recognition has recently attracted much attention. Most existing methods utilize local networks (e.g. recurrent, convolutional. and graph convolutional networks) to extract spatio-temporal dynamics hierarchically. As a consequence, the local and non-local dependencies. which contain more details and semantics respectively, are asynchronously captured in different level of layers. Moreover, existing methods are limited to the spatio-temporal domain and ignore information in the frequency domain. To better extract synchronous detailed and semantic information from multi-domains, we propose a residual frequency attention (rFA) block to focus on discriminative patterns in the frequency domain, and a synchronous local and non-local (SLnL) block to simultaneously capture the details and semantics in the spatio-temporal domain. Besides, a soft-margin focal loss (SMFL) is proposed to optimize the learning whole process. which automatically conducts data selection and encourages intrinsic margins in classifiers. Our approach significantly outperforms other state-of-the-art methods on several large-scale datasets.
机译:从其简洁和坚固性中受益,基于骨架的动作识别最近引起了很多关注。大多数现有方法利用本地网络(例如,经常性,卷积的。和图表卷积网络)分层提取时空动态。因此,本地和非本地依赖项。其中包含更多详细信息和语义,在不同的层次中异步捕获。此外,现有方法仅限于时空域并忽略频域中的信息。为了更好地从多个域中提取同步详细和语义信息,我们提出了剩余频率的注意力(RFA)块,以聚焦频域中的判别模式,以及同步本地和非本地(SLNL)块以同时捕获细节和时空域中的语义。此外,提出了一种柔软的焦点损失(SMFL)来优化学习整个过程。它自动进行数据选择并鼓励分类器中的内在边缘。我们的方法在几个大型数据集上显着优于其他最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号