首页> 外文期刊>International Journal of Computer Vision >Robust object detection with interleaved categorization and segmentation
【24h】

Robust object detection with interleaved categorization and segmentation

机译:具有交错分类和分段的强大对象检测

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents a novel method for detecting and localizing objects of a visual category in cluttered real-world scenes. Our approach considers object categorization and figure-ground segmentation as two interleaved processes that closely collaborate towards a common goal. As shown in our work, the tight coupling between those two processes allows them to benefit from each other and improve the combined performance. The core part of our approach is a highly flexible learned representation for object shape that can combine the information observed on different training examples in a probabilistic extension of the Generalized Hough Transform. The resulting approach can detect categorical objects in novel images and automatically infer a probabilistic segmentation from the recognition result. This segmentation is then in turn used to again improve recognition by allowing the system to focus its efforts on object pixels and to discard misleading influences from the background. Moreover, the information from where in the image a hypothesis draws its support is employed in an MDL based hypothesis verification stage to resolve ambiguities between overlapping hypotheses and factor out the effects of partial occlusion. An extensive evaluation on several large data sets shows that the proposed system is applicable to a range of different object categories, including both rigid and articulated objects. In addition, its flexible representation allows it to achieve competitive object detection performance already from training sets that are between one and two orders of magnitude smaller than those used in comparable systems.
机译:本文提出了一种在杂乱的现实世界场景中检测和定位视觉类别对象的新颖方法。我们的方法将对象分类和图底分割视为两个紧密协作以实现共同目标的交错过程。如我们的工作所示,这两个过程之间的紧密耦合使它们能够彼此受益,并提高了综合性能。我们方法的核心部分是针对对象形状的高度灵活的学习表示形式,可以将在不同训练示例中观察到的信息结合到广义霍夫变换的概率扩展中。所得到的方法可以检测新颖图像中的分类对象,并自动从识别结果中推断出概率分割。然后,通过使系统将精力集中在目标像素上,并丢弃来自背景的误导性影响,进而将该细分用于提高识别度。此外,在基于MDL的假设验证阶段中采用了假设在图像中获得支持的信息,以解决重叠假设之间的歧义并排除部分遮挡的影响。对几个大型数据集的广泛评估表明,所提出的系统适用于一系列不同的物体类别,包括刚性物体和铰接物体。此外,它的灵活表示使其已经可以从训练集中获得竞争对象检测性能,该训练集比可比系统中使用的训练集小一到两个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号