Algorithmic issues in visual object recognition.

机译：视觉对象识别中的算法问题。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This thesis is divided into two parts covering two aspects of research in the area of visual object recognition.;Part I is about human detection in still images. Human detection is a challenging computer vision task due to the wide variability in human visual appearances and body poses. In this part, we present several enhancements to human detection algorithms. First, we present an extension to the integral images framework to allow for constant time computation of non-uniformly weighted summations over rectangular regions using a bundle of integral images. Such computational element is commonly used in constructing gradient-based feature descriptors, which are the most successful in shape-based human detection. Second, we introduce deformable features as an alternative to the conventional static features used in classifiers based on boosted ensembles. Deformable features can enhance the accuracy of human detection by adapting to pose changes that can be described as translations of body features. Third, we present a comprehensive evaluation framework for cascade-based human detectors. The presented framework facilitates comparison between cascade-based detection algorithms, provides a confidence measure for result, and deploys a practical evaluation scenario.;Part II explores the possibilities of enhancing the speed of core algorithms used in visual object recognition using the computing capabilities of Graphics Processing Units (GPUs). First, we present an implementation of Graph Cut on GPUs, which achieves up to 4x speedup against compared to a CPU implementation. The Graph Cut algorithm has many applications related to visual object recognition such as segmentation and 3D point matching. Second, we present an efficient sparse approximation of kernel matrices for GPUs that can significantly speed up kernel based learning algorithms, which are widely used in object detection and recognition. We present an implementation of the Affinity Propagation clustering algorithm based on this representation, which is about 6 times faster than another GPU implementation based on a conventional sparse matrix representation.

机译：本文分为两个部分，涵盖了视觉对象识别领域的两个方面的研究。第一部分是静止图像中的人体检测。由于人类视觉外观和身体姿势的广泛差异，人类检测是一项具有挑战性的计算机视觉任务。在这一部分中，我们介绍了人类检测算法的一些增强功能。首先，我们提出了对积分图像框架的扩展，以允许使用一束积分图像对矩形区域上的非均匀加权求和进行恒定时间的计算。这种计算元素通常用于构造基于梯度的特征描述符，这在基于形状的人体检测中最为成功。其次，我们引入了可变形特征，以替代基于增强合奏的分类器中使用的常规静态特征。可变形特征可以通过适应姿势变化（可以描述为身体特征的平移）来提高人类检测的准确性。第三，我们为基于级联的人体检测器提供了一个全面的评估框架。提出的框架促进了基于级联的检测算法之间的比较，提供了结果的置信度，并部署了实际的评估方案。第二部分探讨了利用Graphics的计算能力提高视觉对象识别中使用的核心算法速度的可能性。处理单元（GPU）。首先，我们介绍了在GPU上实现Graph Cut的实现，与CPU实现相比，该实现高达4倍的加速。 Graph Cut算法具有许多与视觉对象识别相关的应用程序，例如分割和3D点匹配。其次，我们为GPU提供了一种有效的稀疏近似内核矩阵，可以显着加快基于内核的学习算法，该算法广泛用于对象检测和识别。我们提出了一种基于这种表示的亲和力传播聚类算法的实现，该算法比另一种基于常规稀疏矩阵表示的GPU实现快约6倍。

著录项

作者
Hussein, Mohamed.;
展开▼
作者单位

University of Maryland, College Park.;

展开▼
授予单位 University of Maryland, College Park.;
学科 Computer Science.
学位 Ph.D.
年度 2009
页码 169 p.
总页数 169
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Functions of the primate temporal lobe cortical visual areas in invariant visual object and face recognition. [J] . Rolls ET Neuron . 2000,第2期

机译：灵长类颞叶皮层视觉区域在不变视觉对象和面部识别中的功能。
2. Visual crowding: a fundamental limit on conscious perception and object recognition. [J] . Whitney D, Levi DM Trends in cognitive sciences . 2011,第4期

机译：视觉拥挤：对意识和对象识别的基本限制。
3. Action observation can prime visual object recognition. [J] . Helbig HB, Steinwender J, Graf M, Experimental Brain Research . 2010,第3a4期

机译：动作观察可以启动视觉对象识别。
4. Combining image-level and object-level inference for weakly supervised object recognition. Application to fisheries acoustics [C] . Lefort R., Fablet R., Karoui I., Image Processing (ICIP 2009), 2009 . 2009

机译：结合图像级和对象级推理，以实现对弱监督对象的识别。在渔业声学中的应用
5. Image features and learning algorithms for biological, generic and social object recognition. [D] . Zhang, Wei. 2009

机译：用于生物，通用和社交对象识别的图像功能和学习算法。
6. Electrical Resistance Tomography for Visualization of Moving Objects Using a Spatiotemporal Total Variation Regularization Algorithm [O] . Bo Chen, Juan F. P. J. Abascal, Manuchehr Soleimani 2018

机译：使用时空总变化正则化算法可视化电阻层析成像的运动对象
7. Special issue Visual system and image technology. 3. From perceptual characteristics to cognitive psychology. 3. Models of visual word perception and recognition. [O] . Toshio Inui 1986

机译：特殊问题视觉系统和图像技术。 3.从感知特征到认知心理学。 3.视觉词的模型感知和识别。

Algorithmic issues in visual object recognition.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅