首页> 外文期刊>Image and Vision Computing >Recognizing In The Depth: Selective 3D Spatial Pyramid Matching Kernel For Object And Scene Categorization
【24h】

Recognizing In The Depth: Selective 3D Spatial Pyramid Matching Kernel For Object And Scene Categorization

机译:深度识别:用于对象和场景分类的选择性3D空间金字塔匹配内核

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a novel approach to recognize object and scene categories in depth images. We introduce a Bag of Words (BoW) representation in 3D, the Selective 3D Spatial Pyramid Matching Kernel (3DSPMK). It starts quantizing 3D local descriptors, computed from point clouds, to build a vocabulary of 3D visual words. This codebook is used to build the 3DSPMK, which starts partitioning a working volume into fine sub-volumes, and computing a hierarchical weighted sum of histogram intersections of visual words at each level of the 3D pyramid structure. With the aim of increasing both the classification accuracy and the computational efficiency of the kernel, we propose two selective hierarchical volume decomposition strategies, based on representative and discriminative sub-volume selection processes, which drastically reduce the pyramid to consider. Results on different RGBD datasets show that our approaches obtain state-of-the-art results for both object recognition and scene categorization.
机译:本文提出了一种新颖的方法来识别深度图像中的对象和场景类别。我们引入了3D字袋(BoW)表示法,即选择性3D空间金字塔匹配内核(3DSPMK)。它开始量化从点云计算出的3D局部描述符,以建立3D视觉单词的词汇表。该代码本用于构建3DSPMK,该3DSPMK开始将工作空间划分为精细的子空间,并在3D金字塔结构的每个级别上计算视觉单词的直方图相交的分层加权总和。为了提高分类的准确性和内核的计算效率,我们基于代表性和区分性子体积选择过程,提出了两种选择性的层次体积分解策略,从而大大减少了要考虑的金字塔。不同RGBD数据集上的结果表明,我们的方法在对象识别和场景分类方面都获得了最新的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号