首页> 外文会议>IEEE International Conference on Image Processing >Action Recognition with Spatial-Temporal Representation Analysis Across Grassmannian Manifold and Euclidean Space
【24h】

Action Recognition with Spatial-Temporal Representation Analysis Across Grassmannian Manifold and Euclidean Space

机译:跨格拉斯曼流形和欧氏空间的时空表示分析动作识别

获取原文

摘要

Action recognition plays an important character for numerous tasks of video area. Although previous works often learn the appearance and motion information with Convolutional Neural Networks (CNNs), they ignore the corresponding space structures of video representation. In this work, we address action recognition task with a Spatial-Temporal representation analysis algorithm Across Grassmannian manifold and Euclidean space (ST-AGE), which considers the appearance and motion information of video samples in an unified framework. For each video sample, we extract temporal features with classical CNNs (e.g., ConvNet, VGG, ResNet) and motion representation with the trajectory tracking method. Both spatial and temporal information can be then analyzed by embedding them on the Grassmannian manifold and Euclidean space, and an appropriate multi-kernel SVM is further conducted. Comprehensive evaluations on HMDB-51 and UCF-101 datasets demonstrate the significant superiority of STAGE over other state-of-the-art for human action recognition.
机译:动作识别对于视频区域的众多任务起着重要的作用。尽管以前的作品经常使用卷积神经网络(CNN)学习外观和运动信息,但他们忽略了视频表示的相应空间结构。在这项工作中,我们使用跨格拉斯曼流形和欧氏空间(ST-AGE)的时空表示分析算法来解决动作识别任务,该算法在一个统一的框架中考虑视频样本的外观和运动信息。对于每个视频样本,我们使用经典的CNN(例如ConvNet,VGG,ResNet)提取时间特征,并使用轨迹跟踪方法提取运动表示。然后,可以通过将它们嵌入格拉斯曼流形和欧几里德空间来分析时空信息,并进一步进行适当的多核SVM。对HMDB-51和UCF-101数据集的综合评估表明,STAGE优于其他最新的人类动作识别技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号