Realistic human action recognition: When deep learning meets VLAD

机译：现实的人体行动认可：当深度学习符合弗拉德时

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human action recognition from realistic scenarios is extremely challenging due to large intra-class variation and complex background clutters. In this paper, by leveraging the strength of deep learning and vector of locally aggregated descriptors (VLAD), we propose a new methods for human action recognition from realistic datsets. We adopt stack convolu-tional independent subspace analysis (ISA) networks to learn 3D cuboid representation directly from spatio-temporal video data; we propose an improved VLAD by incorporating the spatio-temporal geometrical information to encode the deep learned local features. On two challenging realistic datasets: the YouTube action and HMDB51 datasets, the proposed method achieves state-of-the-art performance with an efficient linear SVM classifier, which is competitive with and even better than existing sophisticated algorithms.

机译：由于阶级内部变异和复杂的背景夹斗，人类的行动识别是极具挑战性的。在本文中，通过利用局部聚合描述符的深度学习和向量（VLAD）的强度，我们提出了一种从现实数据集的人类行动识别的新方法。我们采用Stack Compolu-Tional独立子空间分析（ISA）网络直接从时空视频数据学习3D长方体表示;我们通过结合时空几何信息来编码深度学习的本地特征来提出一种改进的V层。在两个具有挑战性的现实数据集：YouTube动作和HMDB51数据集中，该方法使用高效的线性SVM分类器实现最先进的性能，这与现有的复杂算法竞争甚至更好。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年||共5页
会议地点
作者
Lei Zhang; Yangyang Feng; Jiqing Han; Xiantong Zhen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词
VLAD; convolutional ISA; deep learning; geometric information;

机译：VLAD;卷积ISA;深入学习;几何信息;

相似文献

外文文献
中文文献
专利

1. When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data [J] . Tang Hao, Liu Hong, Xiao Wei, Neural Networks and Learning Systems, IEEE Transactions on . 2021,第5期

机译：当字典学习符合深度学习时：具有有限数据的图像识别的深刻字典学习和编码网络
2. Localized Multiple Kernel Learning for Realistic Human Action Recognition in Videos [J] . Song Y., Zheng Y.-T., Tang S., Circuits and Systems for Video Technology, IEEE Transactions on . 2011,第9期

机译：本地化多核学习，用于视频中逼真的人类动作识别
3. Extraction and Recognition Method of Basketball Players’ Dynamic Human Actions Based on Deep Learning [J] . Qiulin Wang, Baole Tao, Fulei Han, Mobile information systems . 2021,第a期

机译：基于深度学习的篮球运动员动态人类动态的提取与识别方法
4. Realistic human action recognition: When deep learning meets VLAD [C] . Lei Zhang, Yangyang Feng, Jiqing Han, IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：逼真的人类动作识别：当深度学习遇到VLAD时
5. Deep Learning of Neuromuscular and Sensorimotor Control with Biomimetic Perception for Realistic Biomechanical Human Animation [D] . Nakada, Masaki. 2017

机译：深度学习神经肌肉和感觉运动控制与仿生感知的逼真的生物力学人类动画。
6. Fusion of Video and Inertial Sensing for Deep Learning–Based Human Action Recognition [O] . Haoran Wei, Roozbeh Jafari, Nasser Kehtarnavaz 2019

机译：视频和惯性传感的融合用于基于深度学习的人类动作识别
7. When Face Recognition Meets with Deep Learning: an Evaluation of Convolutional Neural Networks for Face Recognition [O] . Hu, G, Yang, Y, Yi, D, 2015

机译：当人脸识别与深度学习相遇时：用于人脸识别的卷积神经网络评估

Realistic human action recognition: When deep learning meets VLAD

摘要

著录项

相似文献

相关主题

期刊订阅