首页> 外文会议>International Conference on Automatic Face and Gesture Recognition >Recognizing Gestures from Videos using a Network with Two-branch Structure and Additional Motion Cues
【24h】

Recognizing Gestures from Videos using a Network with Two-branch Structure and Additional Motion Cues

机译:使用具有双分支结构和其他运动提示的网络识别来自视频的手势

获取原文

摘要

In this paper, we propose a method for recognizing gestures from videos which implicitly incorporates multimodal data during training, and makes classification by only using RGB modality data. The network is 3d-convolutional, and includes a shared network for implicitly incorporating multiple modalities, a generation branch for estimating motion regions and a classification branch for classifying gestures. We introduce a type of efficient modality data, binarized motion cues, which include information of moving hand regions, and are learned by using the generation network. The binarized motion cues are given as extra supervision for learning motion in the generation branch. Since features of additional motion cues learned by the generation branch are implicitly fused with features learned by the classification branch, the classification performance can be improved. Experimental results showed that the shared network can extract more discriminable intermediate features, and the network with the classification branch can achieve improved performance by only using RGB modality input data.
机译:在本文中,我们提出了一种用于识别来自视频中隐式融合多模式数据的视频的手势,并仅使用RGB模态数据进行分类。该网络是3D卷积的,并且包括用于隐式地结合多个模态的共享网络,用于估计运动区域的生成分支和用于对手势进行分类分支。我们介绍了一种类型的有效的模态数据,二值化运动提示包括移动手区域的信息,并通过使用生成网络来学习。二值化运动提示作为在生成分支中学习运动的额外监督。由于生成分支学习的附加运动提示的特征是隐式地与分类分支学习的特征融合,因此可以提高分类性能。实验结果表明,共享网络可以提取更可分辨力的中间特征,并且具有分类分支的网络可以仅通过使用RGB模态输入数据来实现改进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号