首页> 外文期刊>Multimedia Tools and Applications >Towards a deep human activity recognition approach based on video to image transformation with skeleton data
【24h】

Towards a deep human activity recognition approach based on video to image transformation with skeleton data

机译:基于视频的深度人类活动识别方法与骨架数据的图像转换

获取原文
获取原文并翻译 | 示例
           

摘要

One of the most recent challenging tasks in computer vision is Human Activity Recognition (HAR), which aims to analyze and detect the human actions for the benefit of many fields such as video surveillance, behavior analysis and healthcare. Several works in the literature are based on the extraction and analysis of human skeletons in the aim of actions recognition. This paper introduces a new HAR approach based on the extraction of human skeletons from videos. Three features extraction techniques are proposed in this work. They used the extracted skeletons from the videos frames in order to construct a single image that summarizes the activity in that video. The first technique, called dynamic skeleton, is founded on the concept of dynamic images introduced in the literature, while the second one, called skeleton superposition, is based on the superposition of the extracted human skeletons in the same image. The third contribution is called body articulations and it uses only the body joints instead of the whole skeleton in order to recognize the ongoing activity. The obtained images from these three techniques are analyzed and classified using a classification system based on transfer learning principle by fine-tuning three well-known pre-trained CNNs (MobileNet, ResNet-50, VGG16). The designed system is validated and tested on two famous datasets for human activity recognition, which are RGBD-HuDact and KTH datasets. The obtained results are outstanding and proved that the implemented system outperforms the state-of-the-art approaches.
机译:计算机愿景中最近的最新挑战任务之一是人类活动识别(HAR),旨在分析和检测人类行动,以便利益许多领域,如视频监控,行为分析和医疗保健。文献中的几项作品是基于人类骷髅的提取和分析,以行动识别。本文介绍了一种基于视频的骷髅提取的新掌握方法。在这项工作中提出了三种特征提取技术。它们从视频帧中使用了提取的骨架,以便构建总结该视频中的活动的单个图像。第一技术,称为动态骨架,建立在文献中引入的动态图像的概念,而第二个称为骨架叠加,基于同一图像中提取的人骨架的叠加。第三贡献称为身体铰接,它只使用身体关节而不是整个骨架,以识别正在进行的活动。通过微调三个众所周知的预先训练的CNNS(MobileNet,Reset-50,VGG16),通过基于转移学习原理的分类系统分析和分类来自这三种技术的所得图像。设计系统在两个着名的人类活动识别数据集上进行了验证和测试,这是RGBD-HUDACT和KTH数据集。获得的结果卓越,并证明了实施的系统优于最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号