首页> 外文会议>IEEE International Conference on Computer and Communications >A Two-Pathway Convolutional Neural Network with Temporal Pyramid Network for Action Recognition
【24h】

A Two-Pathway Convolutional Neural Network with Temporal Pyramid Network for Action Recognition

机译:具有颞金字塔网络的双向卷积神经网络,用于动作识别

获取原文

摘要

In order to solve the problem of imperfect capture of visual rhythm in action recognition, this paper proposes a novel model which is a combination of a two-pathway network and temporal pyramid networks (TPNs). Specifically, our work involves two aspects, on the one hand, we integrate TPNs into the fast pathway and the slow pathway of SlowFast network to capture multi-level features, and then merge the prediction results of the two pathways in the final recognition stage, which boosts performance of our network by enhancing the semantics extraction at input layer and feature layer. On the other hand, we apply a ConvLSTM module to improve the capability of temporal modeling in TPN, which can further strengthen the capture of features in the long-term dimensions, and the advanced TPN promotes the fusion of temporal and spatial features. Experiments on the Kinetics-400 dataset demonstrate the superiority of our novel architecture combining two-pathway network and advanced TPN in action recognition.
机译:为了解决行动识别中的视觉节奏捕获的不完美捕获问题,提出了一种新颖的模型,它是双通路网络和时间金字塔网络(TPN)的组合。具体而言,我们的工作涉及两个方面,一方面,我们将TPN集成到快速途径和节拍网络的慢速路径中,以捕获多级别特征,然后在最终识别阶段合并两个途径的预测结果,这通过增强输入层和特征层的语义提取来提高我们网络的性能。另一方面,我们应用COMMLSTM模块以提高TPN中的时间建模能力,这可以进一步加强长期尺寸中的特征的捕获,并且高级TPN促进时间和空间特征的融合。 Kinetics-400数据集的实验展示了我们的新建筑的优越性,这些架构结合了两通路网络和高级TPN在动作识别中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号