首页> 外文学位 >Dynamic scene interpretation and understanding from two views.
【24h】

Dynamic scene interpretation and understanding from two views.

机译:动态场景的解读和理解有两种观点。

获取原文
获取原文并翻译 | 示例

摘要

Interpretation of a static or dynamic scene starts by segmenting the scene followed by recognition. Our work concentrates on the general problem of segmenting and recognizing animate and inanimate objects in a scene captured from two different views. The two views here refer to either a pair of frames captured by a stereo camera or two frames (with spatial overlap) captured with a moving camera.;The work described in the dissertation starts with an iterative split-and-merge framework for segmentation of an unknown number of objects captured with stereo camera. The disparity of a scene is modeled by approximating various surfaces in the scene to be planar. In the split phase, the number of planar surfaces along with the underlying plane parameters is assumed to be known from the initialization or from the previous merge phase. Based on these parameters, planar surfaces in the disparity image are labeled to minimize the residuals between the actual disparity and the modeled disparity. The labeled planar surfaces are separated into spatially continuous regions which are treated as candidates for the merging that follows. The regions are merged together under a maximum variance constraint while maximizing the merged area. A multi-stage branch-and-bound algorithm is proposed to carry out this optimization efficiently.;For moving objects, a framework is proposed for two-view multiple structure-and-motion segmentation. This segmentation problem has three unknowns namely the memberships, corresponding fundamental matrices and the number of objects. To handle this otherwise recursive problem, hypotheses for fundamental matrices are generated through local sampling. Once the hypotheses are available, a combinatorial selection problem is formulated to optimize a model selection cost which takes into account the hypotheses likelihoods and the model complexity. An explicit model for the outliers is also added for a robust model selection. The model selection cost is minimized through a branch-and-bound procedure.;Followed by segmentation, object recognition was applied to understand the scene. The segmented objects lack exact boundaries; thus shape based recognition or classification will not perform well. We follow a more general approach of visual object recognition instead. Visual object recognition relies on spatial image features to identify the objects. The state of the art visual object recognition approaches use a visual bag-of-words to represent images. Bag-of-features is an orderless collection of invariantly detectable image patches. The approach discards spatial relationships between these patches and, gives objects, their context and the background clutter equal importance. In a modification to the original visual bag-of-words, separate representations for positively and negatively relevant image patches are formed. Improvements in the classification accuracies due to the separation are demonstrated through experimentation.
机译:静态或动态场景的解释首先是对场景进行分割,然后进行识别。我们的工作集中在分割和识别从两个不同视图捕获的场景中的有生命和无生命的对象这一普遍问题。这里的两个视图是指由立体摄像机捕获的一对帧或由移动摄像机捕获的两个帧(具有空间重叠)。本文所描述的工作始于一个迭代的分割合并框架,用于分割图像。立体摄像机捕获的物体数量未知。通过将场景中的各个表面近似为平面来对场景的视差建模。在拆分阶段,假定从初始化或从先前的合并阶段知道平面的数量以及基础平面参数。基于这些参数,标记视差图像中的平面以最小化实际视差和建模视差之间的残差。标记的平面分成空间连续的区域,这些区域被视为随后合并的候选对象。在最大方差约束下将区域合并在一起,同时最大化合并区域。提出了一种多阶段分支定界算法来有效地进行优化。对于运动物体,提出了一种用于二视图多结构和运动分割的框架。该分割问题具有三个未知数,即隶属度,相应的基本矩阵和对象数。为了处理此否则为递归的问题,可通过局部采样生成基本矩阵的假设。一旦假设可用,就考虑到假设可能性和模型复杂性,制定组合选择问题以优化模型选择成本。还添加了针对异常值的显式模型,以进行健壮的模型选择。通过分支定界过程将模型选择成本降至最低。接着进行分割,然后使用对象识别来了解场景。分割的对象缺少精确的边界;因此,基于形状的识别或分类效果不佳。我们改为采用视觉对象识别的更通用方法。视觉对象识别依赖于空间图像特征来识别对象。现有技术水平的视觉对象识别方法使用视觉单词袋来表示图像。功能袋是无序检测到的图像补丁的无序集合。该方法丢弃这些补丁之间的空间关系,并赋予对象,它们的上下文和背景混乱相同的重要性。在对原始视觉单词袋的修改中,形成了正相关和负相关图像块的单独表示。通过实验证明了分离带来的分类准确性的提高。

著录项

  • 作者

    Thakoor, Ninad Shashikant.;

  • 作者单位

    The University of Texas at Arlington.;

  • 授予单位 The University of Texas at Arlington.;
  • 学科 Engineering Electronics and Electrical.;Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 138 p.
  • 总页数 138
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号