首页> 外文OA文献 >Ensemble One-dimensional Convolution Neural Networks for Skeleton-based Action Recognition
【2h】

Ensemble One-dimensional Convolution Neural Networks for Skeleton-based Action Recognition

机译:基于骨架的集合一维卷积神经网络  行动认可

摘要

In this paper, we proposed a effective but extensible residualone-dimensional convolution neural network as base network, based on the thisnetwork, we proposed four subnets to explore the features of skeleton sequencesfrom each aspect. Given a skeleton sequences, the spatial information areencoded into the skeleton joints coordinate in a frame and the temporalinformation are present by multiple frames. Limited by the skeleton sequencerepresentations, two-dimensional convolution neural network cannot be useddirectly, we chose one-dimensional convolution layer as the basic layer. Eachsub network could extract discriminative features from different aspects. Ourfirst subnet is a two-stream network which could explore both temporal andspatial information. The second is a body-parted network, which could gainmicro spatial features and macro temporal features. The third one is anattention network, the main contribution of which is to focus the key framesand feature channels which high related with the action classes in a skeletonsequence. One frame-difference network, as the last subnet, mainly processesthe joints changes between the consecutive frames. Four subnets ensembletogether by late fusion, the key problem of ensemble method is each subnetshould have a certain performance and between the subnets, there are diversityexisting. Each subnet shares a wellperformance basenet and differences betweensubnets guaranteed the diversity. Experimental results show that the ensemblenetwork gets a state-of-the-art performance on three widely used datasets.
机译:本文提出了一个有效但可扩展的残差一维卷积神经网络作为基础网络,在该网络的基础上,提出了四个子网,从各个方面探讨骨架序列的特征。给定骨架序列,将空间信息编码为一帧中的骨架关节坐标,并且通过多个帧呈现时间信息。受骨架序列表示的限制,二维卷积神经网络不能直接使用,我们选择一维卷积层作为基本层。每个子网络都可以从不同方面提取区分特征。我们的第一个子网是两个流的网络,可以同时浏览时间和空间信息。第二个是身体分割的网络,可以获取微观空间特征和宏观时间特征。第三个是注意力网络,其主要作用是集中与骨架序列中的动作类别高度相关的关键帧和特征通道。一个帧差网络作为最后一个子网,主要处理连续帧之间的关节变化。四个子网由后期融合在一起,集成方法的关键问题是每个子网应具有一定的性能,并且子网之间存在多样性。每个子网共享一个性能良好的子网,并且子网之间的差异确保了多样性。实验结果表明,该集成网络在三个广泛使用的数据集上具有最先进的性能。

著录项

  • 作者

    Xu Yangyang; Wang Lei;

  • 作者单位
  • 年度 2018
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号