...
首页> 外文期刊>Multimedia Tools and Applications >Modality-specific and hierarchical feature learning for RGB-D hand-held object recognition
【24h】

Modality-specific and hierarchical feature learning for RGB-D hand-held object recognition

机译:用于RGB-D手持对象识别的特定于模式的和分层的特征学习

获取原文
获取原文并翻译 | 示例
           

摘要

Hand-held object recognition is an important research topic in image understanding and plays an essential role in human-machine interaction. With the easily available RGB-D devices, the depth information greatly promotes the performance of object segmentation and provides additional channel information. While how to extract a representative and discriminating feature from object region and efficiently take advantage of the depth information plays an important role in improving hand-held object recognition accuracy and eventual human-machine interaction experience. In this paper, we focus on a special but important area called RGB-D hand-held object recognition and propose a hierarchical feature learning framework for this task. First, our framework learns modality-specific features from RGB and depth images using CNN architectures with different network depth and learning strategies. Secondly a high-level feature learning network is implemented for a comprehensive feature representation. Different with previous works on feature learning and representation, the hierarchical learning method can sufficiently dig out the characteristics of different modal information and efficiently fuse them in a unified framework. The experimental results on HOD dataset illustrate the effectiveness of our proposed method.
机译:手持物体识别是图像理解中的重要研究课题,在人机交互中起着至关重要的作用。使用容易获得的RGB-D设备,深度信息极大地提高了对象分割的性能并提供了额外的通道信息。如何从对象区域中提取出具有代表性和区别性的特征并有效利用深度信息,对于提高手持式对象识别的准确性和最终的人机交互体验起着重要的作用。在本文中,我们专注于一个特殊但重要的领域,即RGB-D手持对象识别,并为此任务提出了一种层次特征学习框架。首先,我们的框架使用具有不同网络深度和学习策略的CNN架构,从RGB和深度图像中学习特定于模式的功能。其次,实现高级特征学习网络以实现全面的特征表示。与以前关于特征学习和表示的工作不同,分层学习方法可以充分挖掘出不同模态信息的特征,并有效地将它们融合在一个统一的框架中。 HOD数据集的实验结果证明了该方法的有效性。

著录项

  • 来源
    《Multimedia Tools and Applications》 |2017年第3期|4273-4290|共18页
  • 作者单位

    Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China;

    Ningxia Univ, Sch Math & Comp Sci, Ningxia 750021, Peoples R China;

    Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China;

    Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China|Shandong Univ Sci & Technol, Coll Informat Sci & Engn, Qingdao, Shandong, Peoples R China;

    Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China;

    Lenovo Corp Res, Beijing 100085, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Feature learning; RGB-D object recogntion; Multiple modalities;

    机译:特征学习;RGB-D目标识别;多种形式;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号