首页> 外文期刊>Computer vision and image understanding >Interpretation of complex scenes using dynamic tree-structure Bayesian networks
【24h】

Interpretation of complex scenes using dynamic tree-structure Bayesian networks

机译:使用动态树结构贝叶斯网络解释复杂场景

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses the problem of object detection and recognition in complex scenes, where objects are partially occluded. The approach presented herein is based on the hypothesis that a careful analysis of visible object details at various scales is critical for recognition in such settings. In general, however, computational complexity becomes prohibitive when trying to analyze multiple sub-parts of multiple objects in an image. To alleviate this problem, we propose a generative-model framework-namely, dynamic tree-structure belief networks (DTSBNs). This framework formulates object detection and recognition as inference of DTSBN structure and image-class conditional distributions, given an image. The causal (Markovian) dependencies in DTSBNs allow for design of computationally efficient inference, as well as for interpretation of the estimated structure as follows: each root represents a whole distinct object, while children nodes down the sub-tree represent parts of that object at various scales. Therefore, within the DTSBN framework, the treatment and recognition of object parts requires no additional training, but merely a particular interpretation of the tree/subtree structure. This property leads to a strategy for recognition of objects as a whole through recognition of their visible parts. Our experimental results demonstrate that this approach remarkably outperforms strategies without explicit analysis of object parts.
机译:本文讨论了在物体被部分遮挡的复杂场景中的物体检测和识别问题。本文提出的方法基于这样的假设,即对各种规模的可见对象细节进行仔细分析对于这种设置的识别至关重要。但是,通常,在尝试分析图像中多个对象的多个子部分时,计算复杂性变得过高。为了缓解此问题,我们提出了一个生成模型框架,即动态树结构信念网络(DTSBN)。该框架将给定图像的对象检测和识别表示为DTSBN结构和图像类条件分布的推断。 DTSBN中的因果(马尔科夫)依存关系允许设计有效的推理,并可以按如下方式解释估计的结构:每个根代表一个完整的不同对象,而子树下的子节点代表该对象的部分。各种规模。因此,在DTSBN框架内,对对象部分的处理和识别不需要任何额外的培训,而只需对树/子树结构进行特定的解释即可。此属性导致通过识别对象的可见部分来整体识别对象的策略。我们的实验结果表明,这种方法在不显式分析对象部分的情况下明显优于策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号