首页> 外文学位 >Efficient search algorithms for finding deformable shapes.
【24h】

Efficient search algorithms for finding deformable shapes.

机译:用于查找可变形形状的高效搜索算法。

获取原文
获取原文并翻译 | 示例

摘要

This thesis investigates the problem of search and convergence complexity for algorithms which detect deformable visual objects, such as hands, in images. This problem is of great practical importance for the design of artificial vision systems and also throws light on the biology of human perception. The thesis formulates vision as a decoding problem where the goal is to determine information about the world from intensity patterns reaching the eye or camera.; Deformable template models are used to represent the objects by encoding probabilities for their shape and appearance. These models are partially specified by the user and partially learned from representative image data. Bayesian probability theory is used to synthesize shapes from these models and verify their plausibility.; The thesis gives a framework for detecting deformable shapes in terms of the A* search procedure. It proves that many current vision search algorithms, such as twenty questions, dynamic programming, and Dijkstra, are special cases of A*. Using this framework, and techniques adapted from information theory, it is possible to prove expected convergence times of search algorithms and to define a measure of search complexity for deformable shapes (analogous to an order parameter in statistical physics). These theoretical results demonstrate the existence of search algorithms with acceptable time complexity which can detect these deformable shapes.; The theory is illustrated by computer experiments using dynamic programming and A* for detecting shapes such as hands and cat ears. The experiments show that the algorithms can deal with significant shape deformations, large occlusions of the target objects, and the presence of multiple targets.
机译:本文研究了用于检测图像中可变形视觉对象(例如手)的算法的搜索和收敛复杂性问题。这个问题对于人工视觉系统的设计具有重要的现实意义,并且也为人类感知的生物学提供了启示。本文将视觉​​描述为解码问题,其目的是根据到达眼睛或相机的强度模式确定有关世界的信息。可变形模板模型用于通过编码对象形状和外观的概率来表示对象。这些模型由用户部分指定,并部分从代表图像数据中学习。贝叶斯概率理论用于从这些模型中合成形状并验证其合理性。本文提供了一种根据A *搜索过程来检测可变形形状的框架。证明了许多当前的视觉搜索算法,例如二十个问题,动态编程和Dijkstra,都是A *的特例。使用此框架以及从信息理论改编的技术,可以证明搜索算法的预期收敛时间,并可以定义可变形形状(类似于统计物理学中的阶数参数)的搜索复杂度。这些理论结果证明存在具有可接受的时间复杂度的搜索算法,该算法可以检测这些可变形的形状。通过使用动态程序设计和A *来检测形状(例如手和猫耳朵)的计算机实验说明了该理论。实验表明,该算法可以处理明显的形状变形,目标物体的较大遮挡以及多个目标的存在。

著录项

  • 作者

    Coughlan, James Martin.;

  • 作者单位

    Harvard University.;

  • 授予单位 Harvard University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1998
  • 页码 127 p.
  • 总页数 127
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号