...
首页> 外文期刊>Computer vision and image understanding >Efficient semantic image segmentation with multi-class ranking prior
【24h】

Efficient semantic image segmentation with multi-class ranking prior

机译:具有多类别优先级的高效语义图像分割

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Semantic image segmentation is of fundamental importance in a wide variety of computer vision tasks, such as scene understanding, robot navigation and image retrieval, which aims to simultaneously decompose an image into semantically consistent regions. Most of existing works addressed it as structured prediction problem by combining contextual information with low-level cues based on conditional random fields (CRFs), which are often learned by heuristic search based on maximum likelihood estimation. In this paper, we use maximum margin based structural support vector machine (S-SVM) model to combine multiple levels of cues to attenuate the ambiguity of appearance similarity and propose a novel multi-class ranking based global constraint to confine the object classes to be considered when labeling regions within an image. Compared with existing global cues, our method is more balanced between expressive power for heterogeneous regions and the efficiency of searching exponential space of possible label combinations. We then introduce inter-class co-occurrence statistics as pairwise constraints and combine them with the prediction from local and global cues based on S-SVMs framework. This enables the joint inference of labeling within an image for better consistency. We evaluate our algorithm on two challenging datasets which are widely used for semantic segmentation evaluation: MSRC-21 dataset and Stanford Background dataset and experimental results show that we obtain high competitive performance compared with state-of-the-art methods, despite that our model is much simpler and efficient.
机译:语义图像分割在各种各样的计算机视觉任务(例如场景理解,机器人导航和图像检索)中至关重要,这些任务旨在将图像同时分解为语义一致的区域。现有的大多数工作都通过将上下文信息与基于条件随机字段(CRF)的低级提示相结合来解决结构化的预测问题,通常通过基于最大似然估计的启发式搜索来学习这些提示。在本文中,我们使用基于最大余量的结构支持向量机(S-SVM)模型来组合多个级别的线索,以减轻外观相似性的歧义,并提出一种基于多类排序的全局全局约束,以将对象类限制为在标记图像中的区域时考虑的因素。与现有的全局提示相比,我们的方法在异构区域的表达能力与搜索可能的标签组合的指数空间的效率之间更加平衡。然后,我们将类间共现统计作为成对约束,并将其与基于S-SVM框架的局部和全局提示的预测相结合。这样可以联合推断图像中的标签以获得更好的一致性。我们在广泛用于语义细分评估的两个具有挑战性的数据集上评估了我们的算法:MSRC-21数据集和Stanford Background数据集,实验结果表明,尽管我们的模型与最新技术方法相比,我们仍具有较高的竞争表现更简单高效。

著录项

  • 来源
    《Computer vision and image understanding》 |2014年第3期|81-90|共10页
  • 作者单位

    Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China,Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China,State Key Laboratory of Intelligent Technology and Systems, Beijing 100084, China,Tsinghua National Laboratory for Information Science and Technology, Beijing 100084, China;

    Huawei Noah's Ark Lab, Hong Kong, China;

    Department of Cognitive Science, School of Information Science and Technology, Xiamen University, Xiamen 361005, China;

    Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China,State Key Laboratory of Intelligent Technology and Systems, Beijing 100084, China,Tsinghua National Laboratory for Information Science and Technology, Beijing 100084, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Computer vision; Machine learning; Semantic segmentation; Structural SVMs;

    机译:计算机视觉;机器学习;语义分割;结构型支持向量机;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号