首页> 外文期刊>Neurocomputing >Visual phrase recognition by modeling 3D spatial context of multiple objects
【24h】

Visual phrase recognition by modeling 3D spatial context of multiple objects

机译:通过对多个对象的3D空间上下文建模来进行视觉短语识别

获取原文
获取原文并翻译 | 示例
           

摘要

Automatically recognizing the visual phrase of an image is a challenging issue in computer vision. In this paper, we propose a method to discover and identify the visual phrase by automatically analyzing 3D spatial geometric structure of an image. It includes two steps: (1) learning 3D spatial geometric model; and (2) recognizing visual phrase. To achieve the first goal, we propose 3D geometric models (3DSG) that jointly capture both the features of objects and 3D spatial layout among objects in a visual phrase. In the second step, we transform the visual phrase recognition into verification by measuring the similarity of spatial configuration between the given visual pattern and the 3DSG model. The nature of our method makes itself precisely determine whether the given visual pattern belongs to a specific 3DSG model or not by maximizing the joint probability of the given visual pattern and a 3DSG model. Experiments conducted on several datasets show that our model outperforms the state-of-the-art models in modeling 3D spatial geometric structure as well as recognizing visual phrase. The results also demonstrate that modeling 3D spatial configuration between objects can significantly improve the deeper image understanding. (C) 2017 Elsevier B.V. All rights reserved.
机译:在计算机视觉中,自动识别图像的视觉短语是一个具有挑战性的问题。在本文中,我们提出了一种通过自动分析图像的3D空间几何结构来发现和识别视觉短语的方法。它包括两个步骤:(1)学习3D空间几何模型; (2)识别视觉短语。为了实现第一个目标,我们提出了3D几何模型(3DSG),该模型可以用视觉短语共同捕获对象的特征和对象之间的3D空间布局。在第二步中,我们通过测量给定视觉模式和3DSG模型之间空间配置的相似性,将视觉短语识别转换为验证。我们方法的本质是通过最大化给定视觉模式和3DSG模型的联合概率,来精确地确定给定视觉模式是否属于特定3DSG模型。在多个数据集上进行的实验表明,在对3D空间几何结构进行建模以及识别视觉短语方面,我们的模型优于最新模型。结果还表明,对对象之间的3D空间配置建模可以显着改善对图像的更深入理解。 (C)2017 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2017年第30期|183-192|共10页
  • 作者

    Bai Lin; Chen Qingfeng;

  • 作者单位

    Guangxi Univ, Sch Comp Elect & Informat, Nanning 530004, Peoples R China;

    Guangxi Univ, Sch Comp Elect & Informat, Nanning 530004, Peoples R China|Guangxi Univ, State Key Lab Conservat & Utilizat Subtrop Agrobi, Nanning 530004, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Structured visual pattern; Visual phrase; Spatial geometric information; Spatial compatibility;

    机译:结构化视觉模式;视觉短语;空间几何信息;空间相容性;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号