首页> 外文学位 >Hypergraph based visual categorization and segmentation.
【24h】

Hypergraph based visual categorization and segmentation.

机译:基于超图的视觉分类和细分。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation explores original techniques for the construction of hypergraph models for computer vision applications. A hypergraph is a generalization of a pairwise simple graph, where an edge can connect any number of vertices. The expressive power of the hypergraph models places a special emphasis on the relationship among three or more objects, which has made hypergraphs better models of choice in a lot of problems. This is in sharp contrast with the more conventional graph representation of visual patterns where only pairwise connectivity between objects is described. The contribution of this thesis is fourfold: (i) For the first time the advantage of the hypergraph neighborhood structure is analyzed. We argue that the summarized local grouping information contained in hypergraphs causes an 'averaging' effect which is beneficial to the clustering problems, just as local image smoothing may be beneficial to the image segmentation task. (ii) We discuss how to build hypergraph incidence structures and how to solve the related unsupervised and semi-supervised problems for three different computer vision scenarios: video object segmentation, unsupervised image categorization and image retrieval. We compare our algorithms with state-of-the-art methods and the effectiveness of the proposed methods is demonstrated by extensive experimentation on various datasets. (iii) For the application of image retrieval, we propose a novel hypergraph model --- probabilistic hypergraph to exploit the structure of the data manifold by considering not only the local grouping information, but also the similarities between vertices in hyperedges. (iv) In all three applications mentioned above, we conduct an in depth comparison between simple graph and hypergraph based algorithms, which is also beneficial to other computer vision applications.
机译:本文探讨了用于计算机视觉应用的超图模型构建的原始技术。超图是成对的简单图的概括,其中一条边可以连接任意数量的顶点。超图模型的表达能力特别强调三个或更多对象之间的关系,这使超图在许多问题中成为更好的选择模型。这与视觉模式的更传统图形表示形式形成鲜明对比,在视觉图形中,仅描述了对象之间的成对连接。本文的贡献有四个方面:(i)首次分析了超图邻域结构的优势。我们认为,超图中包含的汇总局部分组信息会导致“平均”效应,这有利于聚类问题,就像局部图像平滑可能会有利于图像分割任务一样。 (ii)我们讨论了如何针对三种不同的计算机视觉场景建立超图关联结构以及如何解决相关的无监督和半监督问题:视频对象分割,无监督的图像分类和图像检索。我们将我们的算法与最先进的方法进行比较,并且通过对各种数据集进行广泛的实验证明了所提出方法的有效性。 (iii)对于图像检索的应用,我们提出了一种新颖的超图模型-概率超图,它不仅考虑局部分组信息,而且考虑超边缘中顶点之间的相似性,从而利用数据流形的结构。 (iv)在上述所有三个应用程序中,我们在基于简单图和超图的算法之间进行了深度比较,这也对其他计算机视觉应用程序有利。

著录项

  • 作者

    Huang, Yuchi.;

  • 作者单位

    Rutgers The State University of New Jersey - New Brunswick.;

  • 授予单位 Rutgers The State University of New Jersey - New Brunswick.;
  • 学科 Engineering Computer.;Computer Science.;Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号