首页> 外文学位 >Computational methods for perceptual organization and object recognition.
【24h】

Computational methods for perceptual organization and object recognition.

机译:用于感知组织和对象识别的计算方法。

获取原文
获取原文并翻译 | 示例

摘要

Perceptual organization, the initial organization of an image into meaningful components, is a fundamental topic in computer vision. In this thesis, I focus my research on developing computational models for perceptual organization and its utility in shape-based object recognition from images.;For perceptual organization, I introduce a novel global contour criterion for guiding image segmentation. For this criterion, instead of modeling boundaries by local discontinuity, I propose to use the global derivative of a region-based criterion with respect to the entire contour of a segment. For several existing region-based criteria, including MDL and NCuts, I show theoretically and experimentally that this global derivative criterion is related to the simultaneous contrast effect of visual psychology and provides important complementary information to the original region-based criterion. Incorporating this global discontinuity criterion significantly improves the performance.;For the utility of perceptual organization, I propose to use image segmentations to represent and compare shapes. Matching segmentations based on mutual information allows accurate shape matching between images and achieves efficiency by avoiding the need for computing point-to-point edge correspondences. To address the unreliability of low-level image segmentations, a Bayesian technique is developed to estimate the average matching score of all possible segmentations of the compared images. This Bayesian averaging technique also has implications for perceptual organization. Based upon it, I develop a novel image segmentation algorithm that computes the mean segmentation of an image. This technique gives better consistency and predictability in computing image segmentations. I also introduce a new perspective on structure preserving image smoothing that smooths images according to their global optimal structure, i.e. the mean segmentation.;Promising results are shown in our extensive experiments in object detection, shape-based tracking, image smoothing, and image segmentation.
机译:感知组织是将图像初步组织成有意义的组成部分,是计算机视觉的基本主题。在本文中,我的研究重点是开发用于感知组织的计算模型及其在基于图像的基于形状的对象识别中的实用性。对于感知组织,我介绍了一种新颖的全局轮廓准则来指导图像分割。对于此准则,我建议不要使用基于区域准则的全局导数相对于线段的整个轮廓来对局部不连续进行建模。对于包括MDL和NCuts在内的几种现有的基于区域的标准,我从理论上和实验上表明,该全局派生标准与视觉心理学的同时对比效应有关,并且为原始的基于区域的标准提供了重要的补充信息。合并此全局不连续性准则可显着提高性能。对于感知组织的实用性,我建议使用图像分割来表示和比较形状。基于互信息的匹配分割可实现图像之间的精确形状匹配,并通过避免计算点对点边缘对应关系的需求来实现效率。为了解决低级图像分割的不可靠性,开发了贝叶斯技术以估计比较图像的所有可能分割的平均匹配分数。这种贝叶斯平均技术也对感知组织有影响。基于此,我开发了一种新颖的图像分割算法,可以计算图像的平均分割。该技术在计算图像分割时具有更好的一致性和可预测性。我还介绍了一种结构保持图像平滑的新观点,该结构可以根据图像的全局最佳结构(即均值分割)对图像进行平滑处理;我们在对象检测,基于形状的跟踪,图像平滑和图像分割的广泛实验中显示了有希望的结果。

著录项

  • 作者

    Wang, Hongzhi.;

  • 作者单位

    Stevens Institute of Technology.;

  • 授予单位 Stevens Institute of Technology.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 87 p.
  • 总页数 87
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术 ;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号