...
首页> 外文期刊>Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing >Hand gesture understanding by weakly-supervised fusing shallow/deep image attributes
【24h】

Hand gesture understanding by weakly-supervised fusing shallow/deep image attributes

机译:通过弱监督融合浅/深映像属性的手势理解

获取原文
获取原文并翻译 | 示例
           

摘要

Accurately recognizing human hand gestures is a useful component in many modern intelligent systems, such as identification authentication, human computer interaction, and sign language recognition. Conventional approaches are typically based on shallow visual features and relatively simple backgrounds, which cannot readily recognize partially occluded hand gestures with sophisticated backgrounds. In this work, we propose a unified hand gesture recognition framework by optimally fusing a set of shallow/deep finger-level image attributes, based on which a weakly-supervised ranking algorithm is designed to select semantically salient regions for gesture understanding. More specifically, given a rich number of hand gesture images, we employ the well-known BING object proposal generator to extract hundreds of object patches that potentially draw human visual attention. Since the hundreds of object patches are still too many for building an effective recognition system, a weakly-supervised metric is proposed to rank them by extracting multiple shallow/deep features. And visual semantics are encoded at region-level by transferring the image-level semantic tags into various human gesture image regions by a weakly-supervised learning paradigm Apparently, the top-ranking highly salient object patches are highly indicative to human visual perception of human hand gesture, Thus we extract their ImageNet-CNN features and further concatenate them. Finally, the concatenated deep feature is fed into a multi-class SVM for classifying each hand gesture image into a particular type. Comprehensive experimental validations have demonstrated the effectiveness and robustness of our proposed hybrid-feature-based hand gesture categorization.
机译:准确地识别人类手势是许多现代智能系统中的有用组件,例如识别认证,人机交互和手语识别。常规方法通常基于浅视特征和相对简单的背景,其不能容易地识别具有复杂的背景的部分封闭的手势。在这项工作中,我们通过最佳地熔断一组浅/深级图像属性来提出统一的手势识别框架,基于该浅/深部指示级图像属性,旨在为手势理解选择语义突出区域来选择弱监督的排名算法。更具体地说,给定丰富的手势图像,我们采用了众所周知的Bing对象提案发生器来提取有数百个可能吸引人类视觉注意的物体斑块。由于数百个对象补丁来构建有效识别系统仍然太多,因此提出了一种弱监管的度量来通过提取多个浅/深度特征来对它们进行排序。通过显然将图像级语义标签转移到各种人类手势图像区域来对区域级进行了编码的,并且显然是人类视觉感知的高度突出的物体斑块的级别高度突出的对象斑块手势,从而提取他们的想象成CNN特征并进一步连接它们。最后,将连接的深度特征馈入多级SVM,用于将每个手势图像分类为特定类型。全面的实验验证已经证明了我们提出的基于混合特征的手势分类的有效性和稳健性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号