首页> 外文期刊>Image and Vision Computing >Learning semantic object parts for object categorization
【24h】

Learning semantic object parts for object categorization

机译:学习语义对象部分以进行对象分类

获取原文
获取原文并翻译 | 示例

摘要

Appearance-based approaches to object recognition mostly rely on measuring the visual similarity of objects based on global or local descriptors. They have shown great success in object identification but often do not generalize to the more challenging case of object categorization, where category membership is often decided not only on a level of appearances, but also on a semantic level. It has been argued that model-based approaches are better suited to this problem, since they allow to inject high-level knowledge, for example about the constituting object parts and possible configurations. Postulating a set of object parts is problematic, though, since it is not guaranteed that those parts can be reliably extracted from real-world images. There is a need for a middle layer, forming an interface between the visual information readily available from the image and the higher-level semantic information that can be used by reasoning processes. In this work, we investigate how such an interface can be learned. As the appearance of object parts may vary considerably, this cannot be achieved by relying on visual similarity alone. Rather, this paper proposes to also use co-location and co-activation, together with weak top-down constraints, such as alignment, as guiding principles for learning the appearance of local object parts. The learned structures generalize beyond the appearance of single objects and often correspond to semantically plausible object parts, such as wheels, trunks, or windshields of cars. In a later stage, a Bayesian network of those extracted structures is used to verify object hypotheses successfully in difficult scenes.
机译:基于外观的对象识别方法主要依靠基于全局或​​局部描述符来测量对象的视觉相似性。它们在对象识别方面显示出巨大的成功,但通常不能推广到更具挑战性的对象分类情况下,在该情况下,类别成员资格不仅取决于外观级别,而且还取决于语义级别。有人认为基于模型的方法更适合于此问题,因为它们允许注入高级知识,例如有关构成对象部分和可能的配置的知识。但是,假定一组对象零件是有问题的,因为不能保证可以可靠地从真实世界的图像中提取这些零件。需要中间层,该中间层在易于从图像获得的视觉信息和可以由推理过程使用的高级语义信息之间形成接口。在这项工作中,我们研究如何学习这样的界面。由于对象零件的外观可能有很大差异,因此仅靠视觉相似性是无法实现的。而是,本文还建议使用共置位和共激活以及自上而下的弱约束(例如对齐)作为学习局部对象零件外观的指导原则。学习到的结构不仅限于单个对象的外观,而且通常对应于语义上合理的对象部分,例如车轮,行李箱或汽车的挡风玻璃。在稍后的阶段中,将这些提取的结构的贝叶斯网络用于成功验证困难场景中的对象假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号