首页> 外文会议>International Joint Conference on Neural Networks >Adversarial Action Data Augmentation for Similar Gesture Action Recognition
【24h】

Adversarial Action Data Augmentation for Similar Gesture Action Recognition

机译:对类似手势动作识别的对抗动作数据增强

获取原文

摘要

Human gestures are unique for recognizing and describing human actions, and video-based human action recognition techniques are effective solutions to varies real-world applications, such as surveillance, video indexing, and human-computer interaction. Most existing video human action recognition approaches either using handcraft features from the frames or deep learning models such as convolutional neural networks (CNN) and recurrent neural networks (RNN); however, they have mostly overlooked the similar gestures between different actions when processing the frames into the models. The classifiers suffer from similar features extracted from similar gestures, which are unable to classify the actions in the video streams. In this paper, we propose a novel framework with generative adversarial networks (GAN) to generate the data augmentation for similar gesture action recognition. The contribution of our work is tri-fold: 1) we proposed a novel action data augmentation framework (ADAF) to enlarge the differences between the actions with very similar gestures; 2) the framework can boost the classification performance either on similar gesture action pairs or the whole dataset; 3) experiments conducted on both KTH and UCF101 datasets show that our data augmentation framework boost the performance on both similar gestures actions as well as the whole dataset compared with baseline methods such as 2DCNN and 3DCNN.
机译:人类手势识别和描述人类行为唯一的,基于视频的人体动作识别技术是有效的解决方案,以改变现实世界的应用,如监控,视频索引和人机交互。大多数现有的视频人类动作识别或者使用来自帧或深的学习模式,如卷积神经网络(CNN)和递归神经网络(RNN)的手工方法的特点;然而,处理帧入模型时,他们大多忽略了不同的动作之间的相似手势。该分类从相似的手势,这是无法在视频流的操作进行分类提取类似的功能受到影响。在本文中,我们提出一种具有生成对抗网络(GAN)的新型框架,以产生类似的手势动作识别数据增强。我们工作的贡献是三方面:1)我们提出了一种新的动作数据增强框架(ADAF)放大具有非常相似的手势动作之间的差异; 2)的框架可以提高上类似手势动作对或整个数据集的分类性能; 3)在两个KTH和UCF101数据集进行的实验表明,我们的数据增强框架提振两个类似的手势操作以及整个数据集与基线的方法,如2DCNN和3DCNN相比的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号