首页> 外文会议>International Conference on Pattern Recognition and Image Analysis >Three-stream Very Deep Neural Network for Video Action Recognition
【24h】

Three-stream Very Deep Neural Network for Video Action Recognition

机译:三流超深度神经网络的视频动作识别

获取原文

摘要

The purpose of this study is to determine whether fine-tuning very deep three-dimensional Convolutional Neural Network (3D CNN) that already pre-trained on an adequately large video dataset will give sufficient motion information for action recognition or still need to have supplementary information. We introduce a three-stream CNN that is based on two-dimensional (2D) and 3D kernels while leveraging successful pre-trained networks on ImageNet and Kinetics datasets. In order to analyze these streams, we fine-tune each on the HMDB-51 challenging dataset and show that supplementary motion information (optical flow and the proposed sparse trajectory image) are critical to action recognition despite using 3D CNN. Experimental outcomes determine that our network reaches 80.92% accuracy on the HMDB-51 dataset and its performance is comparable with the performance of state-of-the-art networks on this dataset.
机译:这项研究的目的是确定是否已经对足够大的视频数据集进行了预训练的非常深的三维卷积神经网络(3D CNN)的微调将提供足够的运动信息以进行动作识别还是仍然需要补充信息。我们介绍了一种基于二维(2D)和3D内核的三流CNN,同时利用了ImageNet和Kinetics数据集上成功的预训练网络。为了分析这些流,我们在HMDB-51具有挑战性的数据集上进行了微调,并表明尽管使用3D CNN,补充运动信息(光学流和提议的稀疏轨迹图像)对于动作识别也至关重要。实验结果确定了我们的网络在HMDB-51数据集上的准确性达到80.92%,其性能可与该数据集上的最新网络的性能相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号