...
首页> 外文期刊>Journal of visual communication & image representation >Multimodal activity recognition with local block CNN and attention- based spatial weighted CNN
【24h】

Multimodal activity recognition with local block CNN and attention- based spatial weighted CNN

机译:用局部块CNN和基于注意力的空间加权CNN的多模式活动识别

获取原文
获取原文并翻译 | 示例
           

摘要

Deep learning based human activity recognition approach combines spatial and temporal information to complete the recognition task. The temporal information is extracted by optical flow, which is always compensated by the warping method in order to achieve better performance. However, these methods usually take the global feature as the starting point, only consider global information of video frames, and ignore local information that reflects the changes of human behavior, causing the algorithm to be sensitive to the external environment such as occlusion, illumination change. In view of the above problems, this paper fuses the local spatial features of video frames, global spatial features and temporal features to recognize different actions, and further extracts the visual attention weight to make constraint on the global spatial features. Experiments show that the algorithm proposed in this paper has better accuracy compared with the existing methods. (C) 2018 Published by Elsevier Inc.
机译:基于深度学习的人类活动识别方法结合了空间和时间信息来完成识别任务。通过光流提取时间信息,该光流量总是通过翘曲方法补偿,以便实现更好的性能。但是,这些方法通常将全局特征作为起点,只考虑视频帧的全局信息,并忽略反映人类行为变化的本地信息,导致算法对诸如遮挡等遮挡等的外部环境敏感。鉴于上述问题,本文融合了视频帧的局部空间特征,全局空间特征和时间特征来识别不同的动作,并进一步提取视觉注意力来对全局空间特征进行约束。实验表明,与现有方法相比,本文提出的算法具有更好的准确性。 (c)2018由elsevier公司出版

著录项

  • 来源
  • 作者单位

    Hangzhou Dianzi Univ Sch Comp Sci & Technol Key Lab Complex Syst Modeling & Simulat Hangzhou 310018 Zhejiang Peoples R China;

    Hangzhou Dianzi Univ Sch Comp Sci & Technol Key Lab Complex Syst Modeling & Simulat Hangzhou 310018 Zhejiang Peoples R China;

    Zhejiang Univ City Coll Artist Design & Creat Sch Dept Visual Commun Design Hangzhou 310000 Zhejiang Peoples R China;

    Hangzhou Dianzi Univ Sch Comp Sci & Technol Key Lab Complex Syst Modeling & Simulat Hangzhou 310018 Zhejiang Peoples R China;

    Beijing Univ Posts & Telecommun Sch Comp Sci Beijing Key Lab Intelligent Telecommun Software & Beijing 100876 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Activity recognition; Multimodal; Visual attention;

    机译:活动识别;多式联版;视觉关注;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号