首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >Few-Shot Learning of Video Action Recognition Only Based on Video Contents
【24h】

Few-Shot Learning of Video Action Recognition Only Based on Video Contents

机译:仅基于视频内容的视频动作识别的少量学习

获取原文

摘要

The success of video action recognition based on Deep Neural Networks (DNNs) is highly dependent on a large number of manually labeled videos. In this paper, we introduce a supervised learning approach to recognize video actions with very few training videos. Specifically, we propose Temporal Attention Vectors (TAVs) which adapt various length videos to preserve the temporal information of the entire video. We evaluate the TAVs on UCF101 and HMDB51. Without training any deep 3D or 2D frame feature extractors on video datasets (only pre-trained on ImageNet), the TAVs only introduce 2.1M parameters but outperforms the state-of-the-art video action recognition benchmarks with very few labeled training videos (e.g. 92% on UCF101 and 59% on HMDB51, with 10 and 8 training videos per class, respectively). Furthermore, our approach can still achieve competitive results on full datasets (97.1% on UCF101 and 77% on HMDB51).
机译:基于深度神经网络(DNN)的视频动作识别的成功高度依赖于大量手动标记的视频。在本文中,我们介绍了一种监督学习方法,以很少的训练视频来识别视频动作。具体来说,我们提出了时间注意向量(TAV),这些向量适用于各种长度的视频,以保留整个视频的时间信息。我们评估UCF101和HMDB51上的TAV。在不对视频数据集进行任何深度3D或2D帧特征提取器训练的情况下(仅在ImageNet上进行了预先训练),TAV仅引入了2.1M参数,但在标记的训练视频很少的情况下,其性能却超过了最新的视频动作识别基准(例如UCF101的92%和HMDB51的59%,每节课分别有10和8个培训视频。此外,我们的方法仍然可以在完整的数据集上获得竞争性的结果(UCF101上为97.1%,HMDB51上为77%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号