首页> 外文期刊>Image Processing, IEEE Transactions on >Large-Scale Weakly Supervised Object Localization via Latent Category Learning
【24h】

Large-Scale Weakly Supervised Object Localization via Latent Category Learning

机译:通过潜在类别学习的大规模弱监督对象定位

获取原文
获取原文并翻译 | 示例
       

摘要

Localizing objects in cluttered backgrounds is challenging under large-scale weakly supervised conditions. Due to the cluttered image condition, objects usually have large ambiguity with backgrounds. Besides, there is also a lack of effective algorithm for large-scale weakly supervised localization in cluttered backgrounds. However, backgrounds contain useful latent information, e.g., the sky in the aeroplane class. If this latent information can be learned, object-background ambiguity can be largely reduced and background can be suppressed effectively. In this paper, we propose the latent category learning (LCL) in large-scale cluttered conditions. LCL is an unsupervised learning method which requires only image-level class labels. First, we use the latent semantic analysis with semantic object representation to learn the latent categories, which represent objects, object parts or backgrounds. Second, to determine which category contains the target object, we propose a category selection strategy by evaluating each category’s discrimination. Finally, we propose the online LCL for use in large-scale conditions. Evaluation on the challenging PASCAL Visual Object Class (VOC) 2007 and the large-scale imagenet large-scale visual recognition challenge 2013 detection data sets shows that the method can improve the annotation precision by 10% over previous methods. More importantly, we achieve the detection precision which outperforms previous results by a large margin and can be competitive to the supervised deformable part model 5.0 baseline on both data sets.
机译:在大范围的弱监督条件下,将对象定位在混乱的背景中具有挑战性。由于图像条件混乱,对象通常与背景具有很大的歧义。此外,在杂乱背景中也缺乏有效的大规模弱监督定位算法。但是,背景包含有用的潜在信息,例如飞机级的天空。如果能够学习该潜在信息,则可以大大降低物体背景的模糊性,并且可以有效地抑制背景。在本文中,我们提出了在大规模混乱情况下的潜在类别学习(LCL)。 LCL是一种无监督的学习方法,仅需要图像级别的类标签。首先,我们使用带有语义对象表示的潜在语义分析来学习潜在类别,这些类别表示对象,对象部分或背景。其次,要确定哪个类别包含目标对象,我们通过评估每个类别的区分度来提出类别选择策略。最后,我们提出了适用于大规模条件的在线LCL。对具有挑战性的PASCAL视觉对象类(VOC)2007和大规模imagenet大规模视觉识别挑战2013检测数据集的评估表明,该方法可将注释精度比以前的方法提高10%。更重要的是,我们实现的检测精度大大优于先前的结果,并且在两个数据集上都可以与监督型可变形零件模型5.0基线竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号