首页> 外文学位 >Algorithmic issues in visual object recognition.
【24h】

Algorithmic issues in visual object recognition.

机译:视觉对象识别中的算法问题。

获取原文
获取原文并翻译 | 示例

摘要

This thesis is divided into two parts covering two aspects of research in the area of visual object recognition.;Part I is about human detection in still images. Human detection is a challenging computer vision task due to the wide variability in human visual appearances and body poses. In this part, we present several enhancements to human detection algorithms. First, we present an extension to the integral images framework to allow for constant time computation of non-uniformly weighted summations over rectangular regions using a bundle of integral images. Such computational element is commonly used in constructing gradient-based feature descriptors, which are the most successful in shape-based human detection. Second, we introduce deformable features as an alternative to the conventional static features used in classifiers based on boosted ensembles. Deformable features can enhance the accuracy of human detection by adapting to pose changes that can be described as translations of body features. Third, we present a comprehensive evaluation framework for cascade-based human detectors. The presented framework facilitates comparison between cascade-based detection algorithms, provides a confidence measure for result, and deploys a practical evaluation scenario.;Part II explores the possibilities of enhancing the speed of core algorithms used in visual object recognition using the computing capabilities of Graphics Processing Units (GPUs). First, we present an implementation of Graph Cut on GPUs, which achieves up to 4x speedup against compared to a CPU implementation. The Graph Cut algorithm has many applications related to visual object recognition such as segmentation and 3D point matching. Second, we present an efficient sparse approximation of kernel matrices for GPUs that can significantly speed up kernel based learning algorithms, which are widely used in object detection and recognition. We present an implementation of the Affinity Propagation clustering algorithm based on this representation, which is about 6 times faster than another GPU implementation based on a conventional sparse matrix representation.
机译:本文分为两个部分,涵盖了视觉对象识别领域的两个方面的研究。第一部分是静止图像中的人体检测。由于人类视觉外观和身体姿势的广泛差异,人类检测是一项具有挑战性的计算机视觉任务。在这一部分中,我们介绍了人类检测算法的一些增强功能。首先,我们提出了对积分图像框架的扩展,以允许使用一束积分图像对矩形区域上的非均匀加权求和进行恒定时间的计算。这种计算元素通常用于构造基于梯度的特征描述符,这在基于形状的人体检测中最为成功。其次,我们引入了可变形特征,以替代基于增强合奏的分类器中使用的常规静态特征。可变形特征可以通过适应姿势变化(可以描述为身体特征的平移)来提高人类检测的准确性。第三,我们为基于级联的人体检测器提供了一个全面的评估框架。提出的框架促进了基于级联的检测算法之间的比较,提供了结果的置信度,并部署了实际的评估方案。第二部分探讨了利用Graphics的计算能力提高视觉对象识别中使用的核心算法速度的可能性。处理单元(GPU)。首先,我们介绍了在GPU上实现Graph Cut的实现,与CPU实现相比,该实现高达4倍的加速。 Graph Cut算法具有许多与视觉对象识别相关的应用程序,例如分割和3D点匹配。其次,我们为GPU提供了一种有效的稀疏近似内核矩阵,可以显着加快基于内核的学习算法,该算法广泛用于对象检测和识别。我们提出了一种基于这种表示的亲和力传播聚类算法的实现,该算法比另一种基于常规稀疏矩阵表示的GPU实现快约6倍。

著录项

  • 作者

    Hussein, Mohamed.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 169 p.
  • 总页数 169
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号