...
首页> 外文期刊>International Journal of Computer Vision >New geometric methods for computer vision: An application to structure and motion estimation
【24h】

New geometric methods for computer vision: An application to structure and motion estimation

机译:计算机视觉的新几何方法:在结构和运动估计中的应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We discuss a coordinate-free approach to the geometry of computer vision problems. The technique we use to analyse the three-dimensional transformations involved will be that of geometric algebra: a framework based on the algebras of Clifford and Grassmann. This is not a system designed specifically for the task in hand, but rather a framework for all mathematical physics. Central to the power of this approach is the way in which the formalism deals with rotations; for example, if we have two arbitrary sets of vectors, known to be related via a 3D rotation, the rotation is easily recoverable if the vectors are given. Extracting the rotation by conventional means is not as straightforward. The calculus associated with geometric algebra is particularly powerful, enabling one, in a very natural way, to take derivatives with respect to any multivector (general element of the algebra). What this means in practice is that we can minimize with respect to rotors representing rotations, vectors representing translations, or any other relevant geometric quantity. This has important implications for many of the least-squares problems in computer vision where one attempts to find optimal rotations, translations etc., given observed vector quantities. We will illustrate this by analysing the problem of estimating motion from a pair of images, looking particularly at the more difficult case in which we have available only 2D information and no information on range. While this problem has already been much discussed in the literature, we believe the present formulation to be the only one in which least-squares estimates of the motion and structure are derived simultaneously using analytic derivatives. [References: 37]
机译:我们讨论了一种无坐标方法来解决计算机视觉问题的几何问题。我们用来分析涉及的三维变换的技术将是几何代数:基于Clifford和Grassmann代数的框架。这不是专门为手头任务设计的系统,而是所有数学物理的框架。这种方法强大的核心是形式主义处理轮换的方式。例如,如果我们有两个任意的向量集,已知它们通过3D旋转相关联,则只要给出向量,旋转就很容易恢复。通过常规手段提取旋转不是那么简单。与几何代数相关的演算特别强大,可以使人以非常自然的方式对任何多向量(代数的一般元素)求导数。实际上,这意味着我们可以相对于代表旋转的转子,代表平移的矢量或任何其他相关的几何量最小化。对于计算机视觉中的许多最小二乘问题,这具有重要意义,在给定观察到的向量数量的情况下,人们试图找到最佳旋转,平移等。我们将通过分析从一对图像估计运动的问题来说明这一点,特别是在更困难的情况下,其中我们仅可获得2D信息而没有距离信息。虽然这个问题已经在文献中进行了很多讨论,但我们相信本公式是唯一使用解析导数同时导出运动和结构的最小二乘估计的公式。 [参考:37]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号