【24h】

A Relational Kernel-Based Framework for Hierarchical Image Understanding

机译:基于关系内核的层次图像理解框架

获取原文

摘要

While relational representations have been popular in early work on syntactic and structural pattern recognition, they are rarely used in contemporary approaches to computer vision due to their pure symbolic nature. The recent progress and successes in combining statistical learning principles with relational representations motivates us to reinvestigate the use of such representations. More specifically, we show that statistical relational learning can be successfully used for hierarchical image understanding. We employ kLog, a new logical and relational language for learning with kernels to detect objects at different levels in the hierarchy. The key advantage of kLog is that both appearance features and rich, contextual dependencies between parts in a scene can be integrated in a principled and interpretable way to obtain a qualitative representation of the problem. At each layer, qualitative spatial structures of parts in images are detected, classified and then employed one layer up the hierarchy to obtain higher-level semantic structures. We apply a four-layer hierarchy to street view images and successfully detect corners, windows, doors, and individual houses.
机译:尽管关系表示法在语法和结构模式识别的早期工作中很流行,但由于其纯粹的符号本质,很少在当代的计算机视觉方法中使用它们。统计学习原理与关系表示相结合的最新进展和成功促使我们重新研究这种表示的使用。更具体地说,我们表明统计关系学习可以成功地用于层次图像理解。我们采用kLog(一种用于与内核一起学习的新逻辑和关系语言)来检测层次结构中不同级别的对象。 kLog的主要优势在于,可以以原则性和可解释的方式集成外观特征和场景中各个部分之间丰富的上下文相关性,以获得问题的定性表示。在每一层,对图像中各部分的定性空间结构进行检测,分类,然后在层次结构中使用一层以获得更高级别的语义结构。我们将四层层次结构应用于街景图像,并成功检测到拐角,窗户,门和独立房屋。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号