首页> 外文学位 >Activity representation from video using statistical models on shape manifolds.
【24h】

Activity representation from video using statistical models on shape manifolds.

机译:使用形状流形上的统计模型从视频中进行活动表示。

获取原文
获取原文并翻译 | 示例

摘要

Activity recognition from video data is a key computer vision problem with applications in surveillance, elderly care, etc. This problem is associated with modeling a representative shape which contains significant information about the underlying activity. In this dissertation, we represent several approaches for view-invariant activity recognition via modeling shapes on various shape spaces and Riemannian manifolds.;The first two parts of this dissertation deal with activity modeling and recognition using tracks of landmark feature points. The motion trajectories of points extracted from objects involved in the activity are used to build deformation shape models for each activity, and these models are used for classification and detection of unusual activities. In the first part of the dissertation, these models are represented by the recovered 3D deformation basis shapes corresponding to the activity using a non-rigid structure from motion formulation. We use a theory for estimating the amount of deformation for these models from the visual data. We study the special case of ground plane activities in detail because of its importance in video surveillance applications.;In the second part of the dissertation, we propose to model the activity by learning an affine invariant deformation subspace representation that captures the space of possible body poses associated with the activity. These subspaces can be viewed as points on a Grassmann manifold. We propose several statistical classification models on Grassmann manifold that capture the statistical variations of the shape data while following the intrinsic Riemannian geometry of these manifolds.;The last part of this dissertation addresses the problem of recognizing human gestures from silhouette images. We represent a human gesture as a temporal sequence of human poses, each characterized by a contour of the associated human silhouette. The shape of a contour is viewed as a point on the shape space of closed curves and, hence, each gesture is characterized and modeled as a trajectory on this shape space. We utilize the Riemannian geometry of this space to propose a template-based and a graphical-based approaches for modeling these trajectories. The two models are designed in such a way to account for the different invariance requirements in gesture recognition, and also capture the statistical variations associated with the contour data.
机译:视频数据的活动识别是计算机视觉的关键问题,在监视,老年人护理等方面的应用。此问题与建模具有代表性的形状有关,该形状包含有关基础活动的重要信息。本文通过对各种形状空间上的形状和黎曼流形进行建模,提出了几种视图不变活动识别方法。从活动中涉及的对象提取的点的运动轨迹用于为每个活动建立变形形状模型,并且这些模型用于对异常活动进行分类和检测。在论文的第一部分,这些模型由与运动对应的,使用活动公式化的非刚性结构的,对应于活动的3D变形基础形状表示。我们使用一种理论从视觉数据估计这些模型的变形量。由于其在视频监控应用中的重要性,我们详细研究了地平面活动的特殊情况。与活动相关的姿势。这些子空间可以看作是格拉斯曼流形上的点。我们在格拉斯曼流形上提出了几种统计分类模型,这些模型在捕捉形状数据的统计变化的同时遵循这些流形的内在黎曼几何。本论文的最后一部分解决了从轮廓图像中识别人的手势的问题。我们将人类手势表示为人类姿势的时间序列,每个姿势都以相关联的人类剪影的轮廓为特征。轮廓的形状被视为封闭曲线的形状空间上的一个点,因此,每个手势的特征和模型都被定义为该形状空间上的轨迹。我们利用该空间的黎曼几何来提出基于模板和基于图形的方法来对这些轨迹进行建模。设计这两种模型时,要考虑手势识别中不同的不变性要求,并且还可以捕获与轮廓数据关联的统计变化。

著录项

  • 作者

    Abdelkader, Mohamed F.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Engineering Electronics and Electrical.;Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 148 p.
  • 总页数 148
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号