首页> 外文学位 >Motion segmentation from perspective images via rank minimization.
【24h】

Motion segmentation from perspective images via rank minimization.

机译:通过等级最小化从透视图图像进行运动分割。

获取原文
获取原文并翻译 | 示例

摘要

In this thesis, we consider the problem of segmenting data points coming from rigid bodies under perspective projection. As we all know the perspective projection camera model is the most general camera model of computer imaging, but unfortunately it is also known to be non-linear. Thus making the problem at the hand harder.;To solve this most general form of segmentation problem, we assume that we are given point correspondence data in image sequences. The data set consist of the projected coordinates of the points in 3D. The main idea of the method is to group the points according to the complexity of their motion in 3D. This idea intuitively formalizes, the fact that points from the same object would share more modes of motion and hence leading to less complex models than points from different objects.;We approached to the problem at the hand from a systems theory perspective. Estimating model order complexity and projective depth of the points is reduced to minimizing the rank of a Hankel matrix, which is constructed using the correspondence data and the unknown depths. This leads to a simple non-iterative segmentation algorithm that optimizes the use of spatial and temporal information. Since the presented algorithm exploits both spatial and temporal constraints, it does not require the estimation of the fundamental matrix and it is less sensitive to the effect of noise and outliers than approaches that rely solely on factorizations of the measurements matrix. In addition, the method can also naturally handle degenerate cases, e.g. cases where the objects partially share motion modes.;We presented and demonstrated this novel motion segmentation algorithm, tested its performance on different kinds of cases as well as comparing it with some existing algorithms.
机译:在本文中,我们考虑了在透视投影下分割来自刚体的数据点的问题。众所周知,透视投影相机模型是计算机成像中最通用的相机模型,但不幸的是,它也是非线性的。因此,使问题变得更加棘手。为了解决这种最普遍形式的分割问题,我们假设我们在图像序列中获得了点对应数据。数据集由3D点的投影坐标组成。该方法的主要思想是根据点在3D中的运动复杂程度将其分组。这个想法直观地形式化了一个事实,即来自同一对象的点将共享更多的运动模式,因此比来自不同对象的点将导致更简单的模型。;我们从系统理论的角度解决了这个问题。减少了估计模型阶数复杂度和点的投影深度,以最小化使用对应数据和未知深度构造的汉克尔矩阵的秩。这导致了一种简单的非迭代分割算法,该算法优化了空间和时间信息的使用。由于所提出的算法利用了空间和时间约束,因此不需要估计基本矩阵,并且与仅依赖于测量矩阵分解的方法相比,它对噪声和离群值的影响不那么敏感。另外,该方法还可以自然地处理退化的情况,例如我们提出并演示了这种新颖的运动分割算法,测试了其在不同情况下的性能,并将其与一些现有算法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号