首页> 外文学位 >Algorithms for capturing human body motion and structure.
【24h】

Algorithms for capturing human body motion and structure.

机译:捕获人体运动和结构的算法。

获取原文
获取原文并翻译 | 示例

摘要

The tasks of a typical computer vision application include: recovering scene structure, detecting/recognizing object/human, recognizing activities, responding to events, etc. These tasks involve critical computer vision problems including establishing correspondence, stereo, structure from stereo, optical flow computation, structure from motion, object/human tracking, event detection/understanding, etc. The research work in this dissertation covers four of these problems.; For solving correspondent problem, we propose a fast hierarchical stereo matching approach using discrete wavelet transform. We show that the compactly supported wavelet basis with n vanishing moments can be regarded as the nth derivative of the signal after some scaled smoothing operation. The area- and feature-based methods are then combined using the multi-resolution framework of wavelet. For computing optical flow, we propose three methods. The first is the extension of the stereo matching algorithm. The second is also based on wavelet multi-scale analysis but under the brightness constraint. A novel coarse-and-find method is designed to alleviate aliasing and error propagation problem. Both of these methods are computationally efficient. The third method achieves greater accuracy. Here, the optical flow problem is formulated as a total least square estimation based on 3D structure tensor analysis. A parametric model and an adaptive neighborhood adjustment are integrated to improve the accuracy and dynamically handle the aperture problem.; For the structure from motion (SfM) problem, we solve the nonlinear SfM system by decomposing it into two linear subsystems, a motion subsystem and a structure subsystem, in a multi-resolution framework. We prove the convergence of our algorithm. The statistical analysis shows that error variances are decreasing along the coarse-to-fine iterations when the optical flow estimation is reasonably good and the structure of the object is relatively flat.; For the tracking problem, the optical flow is implicitly used as a bridge to fill the gap between the brightness constraint and the kinematic chain. The articulated body tracking problem is then formulated as a linear system. A re-initialization scheme is proposed to facilitate system linearization. A new statistical constraint is derived to achieve more accurate tracking.
机译:典型的计算机视觉应用程序的任务包括:恢复场景结构,检测/识别对象/人,识别活动,对事件做出响应等。这些任务涉及关键的计算机视觉问题,包括建立对应关系,立体感,立体感的结构,光流计算从运动,对象/人类跟踪,事件检测/理解等结构。本文的研究工作涵盖了其中四个问题。为了解决对应问题,我们提出了一种使用离散小波变换的快速分层立体匹配方法。我们表明,具有 n 消失矩的紧支撑小波基可以被视为经过一定规模的平滑操作后信号的 n 导数。然后使用小波的多分辨率框架将基于区域和基于特征的方法进行组合。为了计算光流,我们提出了三种方法。首先是立体声匹配算法的扩展。第二种也是基于小波多尺度分析,但受亮度限制。设计了一种新颖的粗略查找方法来减轻混叠和错误传播问题。这两种方法在计算上都是有效的。第三种方法可获得更高的精度。在此,将光流问题公式化为基于3D结构张量分析的总最小二乘估计。集成了参数模型和自适应邻域调整,以提高精度并动态处理孔径问题。对于来自运动的结构(SfM)问题,我们通过在多分辨率框架中将其分解为两个线性子系统(运动子系统和结构子系统)来解决非线性SfM系统。我们证明了算法的收敛性。统计分析表明,当光流估计合理且对象的结构相对平坦时,误差的方差将沿着从粗到精的迭代减小。对于跟踪问题,光流被隐式用作桥接,以填补亮度约束和运动链之间的间隙。然后,将关节运动跟踪问题表述为线性系统。提出了一种重新初始化方案,以促进系统线性化。导出新的统计约束以实现更准确的跟踪。

著录项

  • 作者

    Liu, Haiying.;

  • 作者单位

    University of Maryland College Park.;

  • 授予单位 University of Maryland College Park.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2003
  • 页码 p.5609
  • 总页数 223
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号