首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Rethinking Temporal-Related Sample for Human Action Recognition
【24h】

Rethinking Temporal-Related Sample for Human Action Recognition

机译:重新思考与时间相关的人类动作识别样本

获取原文

摘要

Temporal-related samples always have huge intra-class appearance variation, on which lots of existing action recognition algorithms have poor performance. In this paper, our motivation is to address this issue by utilizing temporal information more effectively. A novel light-weight Voting-based Temporal Correlation module (VTC) is proposed to enhance temporal cues. VTC integrates sparse temporal sampling strategy into feature sequences, so it mitigates the effect of redundant information and focuses more on temporal modeling. Furthermore, we propose a simple and intuitive Similarity Loss (SL) to guide the training procedure for VTC. Introducing confusion in the predicted vector intentionally, SL eases intra-class variation by discovering class-specific common motion pattern rather than sample-specific discriminative information. Combining VTC and SL with complementary advances in this field, we clearly outperform state-of-the-art results on HMDB51, UCF101, and Something-something-v1 dataset. The code has been made publicly available on https://github.com/FingerRec/TRS.
机译:与时间相关的样本始终具有巨大的类内外观变化,在这种情况下,许多现有的动作识别算法均具有较差的性能。在本文中,我们的动机是通过更有效地利用时间信息来解决此问题。提出了一种新颖的基于轻量投票的时间相关模块(VTC)来增强时间提示。 VTC将稀疏的时间采样策略集成到了特征序列中,因此它减轻了冗余信息的影响,并将更多的精力集中在时间建模上。此外,我们提出了一种简单直观的相似度损失(SL)来指导VTC的训练过程。 SL故意在预测矢量中引入混淆,它通过发现特定于类的通用运动模式而不是特定于样本的判别信息来减轻类内差异。将VTC和SL与该领域的互补优势相结合,我们明显优于HMDB51,UCF101和Something-something-v1数据集上的最新结果。该代码已在https://github.com/FingerRec/TRS上公开提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号