首页> 外文学位 >Statistical and geometric methods for visual tracking with occlusion handling and target reacquisition.
【24h】

Statistical and geometric methods for visual tracking with occlusion handling and target reacquisition.

机译:用于遮挡处理和目标重新捕获的视觉跟踪的统计和几何方法。

获取原文
获取原文并翻译 | 示例

摘要

Computer vision is the science that studies how machines understand scenes and automatically make decisions based on meaningful information extracted from an image or multi-dimensional data of the scene, like human vision. One common and well-studied field of computer vision is visual tracking. It is challenging and active research area in the computer vision community. Visual tracking is the task of continuously estimating the pose of an object of interest from the background in consecutive frames of an image sequence. It is a ubiquitous task and a fundamental technology of computer vision that provides low-level information used for high-level applications such as visual navigation, human-computer interaction, and surveillance system.;The focus of the research in this thesis is visual tracking and its applications. More specifically, the object of this research is to design a reliable tracking algorithm for a deformable object that is robust to clutter and capable of occlusion handling and target reacquisition in realistic tracking scenarios by using statistical and geometric methods. To this end, the approaches developed in this thesis make extensive use of region-based active contours and particle filters in a variational framework. In addition, to deal with occlusions and target reacquisition problems, we exploit the benefits of coupling 2D and 3D information of an image and an object.;In this thesis, first, we present an approach for tracking a moving object based on 3D range information in stereoscopic temporal imagery by combining particle filtering and geometric active contours. Range information is weighted by the proposed Gaussian weighting scheme to improve segmentation achieved by active contours. In addition, this work present an on-line shape learning method based on principal component analysis to reacquire track of an object in the event that it disappears from the field of view and reappears later. Second, we propose an approach to jointly track a rigid object in a 2D image sequence and to estimate its pose in 3D space. In this work, we take advantage of knowledge of a 3D model of an object and we employ particle filtering to generate and propagate the translation and rotation parameters in a decoupled manner. Moreover, to continuously track the object in the presence of occlusions, we propose an occlusion detection and handling scheme based on the control of the degree of dependence between predictions and measurements of the system. Third, we introduce the fast level-set based algorithm applicable to real-time applications. In this algorithm, a contour-based tracker is improved in terms of computational complexity and the tracker performs real-time curve evolution for detecting multiple windows. Lastly, we deal with rapid human motion in context of object segmentation and visual tracking. Specifically, we introduce a model-free and marker-less approach for human body tracking based on a dynamic color model and geometric information of a human body from a monocular video sequence. The contributions of this thesis are summarized as follows:;• Reliable algorithm to track deformable objects in a sequence consisting of 3D range data by combining particle filtering and statistics-based active contour models. • Effective handling scheme based on object’s 2D shape information for the challenging situations in which the tracked object is completely gone from the image domain during tracking. • Robust 2D-3D pose tracking algorithm using a 3D shape prior and particle filters on SE(3). • Occlusion handling scheme based on the degree of trust between predictions and measurements of the tracking system, which is controlled in an online fashion. • Fast level set based active contour models applicable to real-time object detection. • Model-free and marker-less approach for tracking of rapid human motion based on a dynamic color model and geometric information of a human body.
机译:计算机视觉是研究机器如何理解场景并根据从场景的图像或多维数据中提取的有意义信息(如人类视觉)自动做出决策的科学。视觉跟踪是计算机视觉的一个常见且经过充分研究的领域。它是计算机视觉社区中充满挑战的活跃研究领域。视觉跟踪是在图像序列的连续帧中从背景连续估计感兴趣对象的姿势的任务。它是无处不在的任务,是计算机视觉的基础技术,它提供了用于视觉导航,人机交互和监视系统等高层应用程序的底层信息。;本论文的研究重点是视觉跟踪及其应用。更具体地说,本研究的目的是为可变形物体设计一种可靠的跟踪算法,该算法对杂乱性很强,并且能够通过统计和几何方法在现实的跟踪场景中进行遮挡处理和目标重新捕获。为此,本文开发的方法在变分框架中广泛使用了基于区域的活动轮廓和粒子滤波器。另外,为了解决遮挡和目标重获问题,我们充分利用了将图像和物体的2D和3D信息耦合的优势。本文首先提出一种基于3D距离信息的运动物体跟踪方法。结合了粒子滤波和几何活动轮廓,在立体时空影像中获得了成功。范围信息由建议的高斯加权方案加权,以改善通过活动轮廓实现的分割。此外,这项工作提出了一种基于主成分分析的在线形状学习方法,以在物体从视野中消失并随后重新出现时重新获得物体的轨迹。其次,我们提出一种方法来共同跟踪2D图像序列中的刚性对象并估算其在3D空间中的姿态。在这项工作中,我们利用了对象3D模型的知识,并采用粒子滤波以分离的方式生成和传播平移和旋转参数。此外,要在存在遮挡的情况下连续跟踪对象,我们提出了一种基于对系统预测和测量之间的依赖程度的控制的遮挡检测和处理方案。第三,我们介绍了适用于实时应用的基于快速水平集的算法。在该算法中,基于轮廓的跟踪器在计算复杂度方面得到了改善,并且该跟踪器执行实时曲线演化以检测多个窗口。最后,我们在对象分割和视觉跟踪的背景下处理人类的快速运动。具体来说,我们引入了一种基于动态色彩模型和来自单眼视频序列的人体几何信息的人体跟踪的无模型和无标记方法。本文的主要工作概括如下:•通过结合粒子滤波和基于统计的主动轮廓模型,在由3D范围数据组成的序列中跟踪可变形物体的可靠算法。 •基于对象的2D形状信息的有效处理方案,用于在跟踪过程中被跟踪对象完全脱离图像域的挑战性情况。 •使用3D形状先验和SE(3)上的粒子过滤器的强大2D-3D姿态跟踪算法。 •基于跟踪系统的预测和测量之间的信任度的遮挡处理方案,该遮挡处理方案以在线方式进行控制。 •适用于实时物体检测的基于快速水平集的活动轮廓模型。 •基于动态颜色模型和人体几何信息的无模型和无标记方法,用于跟踪人体的快速运动。

著录项

  • 作者

    Lee, Jehoon.;

  • 作者单位

    Georgia Institute of Technology.;

  • 授予单位 Georgia Institute of Technology.;
  • 学科 Engineering Computer.;Engineering Electronics and Electrical.;Artificial Intelligence.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 136 p.
  • 总页数 136
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号