首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Realistic human action recognition: When deep learning meets VLAD
【24h】

Realistic human action recognition: When deep learning meets VLAD

机译:现实的人体行动认可:当深度学习符合弗拉德时

获取原文

摘要

Human action recognition from realistic scenarios is extremely challenging due to large intra-class variation and complex background clutters. In this paper, by leveraging the strength of deep learning and vector of locally aggregated descriptors (VLAD), we propose a new methods for human action recognition from realistic datsets. We adopt stack convolu-tional independent subspace analysis (ISA) networks to learn 3D cuboid representation directly from spatio-temporal video data; we propose an improved VLAD by incorporating the spatio-temporal geometrical information to encode the deep learned local features. On two challenging realistic datasets: the YouTube action and HMDB51 datasets, the proposed method achieves state-of-the-art performance with an efficient linear SVM classifier, which is competitive with and even better than existing sophisticated algorithms.
机译:由于阶级内部变异和复杂的背景夹斗,人类的行动识别是极具挑战性的。在本文中,通过利用局部聚合描述符的深度学习和向量(VLAD)的强度,我们提出了一种从现实数据集的人类行动识别的新方法。我们采用Stack Compolu-Tional独立子空间分析(ISA)网络直接从时空视频数据学习3D长方体表示;我们通过结合时空几何信息来编码深度学习的本地特征来提出一种改进的V层。在两个具有挑战性的现实数据集:YouTube动作和HMDB51数据集中,该方法使用高效的线性SVM分类器实现最先进的性能,这与现有的复杂算法竞争甚至更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号