首页> 外文学位 >A perceptual organization approach for figure completion, binocular and multiple-view stereo and machine learning using tensor voting.
【24h】

A perceptual organization approach for figure completion, binocular and multiple-view stereo and machine learning using tensor voting.

机译:一种感知组织方法,用于完成张量,双目和多视图立体声以及使用张量投票的机器学习。

获取原文
获取原文并翻译 | 示例

摘要

This works extends the tensor voting framework and addresses a wide range of problems from a perceptual organization perspective. The most important contributions are the addition of boundary inference capabilities, a novel re-formulation of the framework applicable to high-dimensional spaces and the development of algorithms for computer vision and machine learning problems. In all cases, the problem is formulated as the organization of the inputs into salient perceptual structures.;For single image analysis, we address the inference of integrated descriptions in terms of edges and keypoints in way that can be useful for higher level processes. We also address a higher level problem: figure completion. We propose a computational framework which implements both modal and amodal completion and provides a fully automatic decision making mechanism for selecting between them. We illustrate the approach on several inputs producing interpretations consistent with those of human observers.;We propose an approach for stereovision that considers both binocular and monocular cues. It allows the integration of matching candidates generated by different operators, combining their strengths. Then, perceptual organization is performed in 3-D under the assumption that correct matches, unlike wrong ones, form salient coherent surfaces. Disparity hypotheses for unmatched pixels are generated considering both geometric and photometric criteria. We also address dense, multiple-view stereo under the same assumption. Unlike other approaches, ours can process all data simultaneously, can be applied to more general camera configurations and does not require foreground/background segmentation. We were able to reconstruct scenes that provide serious challenges to state-of-the-art methods.;Finally, we present a new implementation of tensor voting that can be generalized to spaces with hundreds of dimensions, since it achieves significant reductions in storage and computational requirements. Its advantages include its applicability to a far wider range of datasets, noise robustness, absence of global computations and capability to process very large numbers of points. We present results in dimensionality and manifold orientation estimation, geodesic distance measurement, nonlinear interpolation and function approximation. This work opens the door for applications such as unsupervised classification, forward and inverse kinematics.
机译:这项工作扩展了张量投票框架,并从可感知的组织角度解决了许多问题。最重要的贡献是增加了边界推断功能,对适用于高维空间的框架进行了新颖的重新表述,并开发了计算机视觉和机器学习问题的算法。在所有情况下,问题都被表述为显性感知结构中的输入组织。对于单个图像分析,我们以边缘和关键点的方式解决了集成描述的推理,这可能对更高级别的过程很有用。我们还解决一个更高层次的问题:图形完成。我们提出了一个计算框架,该框架可以实现模态完成和非模态完成,并提供一种在它们之间进行选择的全自动决策机制。我们在几种输入上说明了与人类观察者的解释相符的方法。我们提出了一种同时考虑了双眼和单眼线索的立体视觉方法。它允许整合不同运营商生成的匹配候选者,并结合他们的优势。然后,在假设正确匹配与错误匹配不同而形成明显相干表面的假设下,以3-D执行感知组织。针对不匹配像素的视差假设是根据几何和光度学标准生成的。在相同的假设下,我们还将解决密集的多视图立体声问题。与其他方法不同,我们的方法可以同时处理所有数据,可以应用于更一般的相机配置,并且不需要前景/背景分割。最后,我们提出了张量投票的新实现,可以将其推广到具有数百个维度的空间,因为它可以显着减少存储量和存储量,从而可以重构场景。计算要求。它的优势包括适用于更广泛的数据集,噪声鲁棒性,无需全局计算以及能够处理大量点的能力。我们介绍了尺寸和流形方向估计,测地线距离测量,非线性插值和函数逼近的结果。这项工作为无监督分类,正向和反向运动学等应用打开了大门。

著录项

  • 作者

    Mordohai, Philippos.;

  • 作者单位

    University of Southern California.;

  • 授予单位 University of Southern California.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 237 p.
  • 总页数 237
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:42:56

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号