首页> 外文会议>AAAI Conference on Artificial Intelligence >Tell Me What They're Holding: Weakly-Supervised Object Detection with Transferable Knowledge from Human-Object Interaction
【24h】

Tell Me What They're Holding: Weakly-Supervised Object Detection with Transferable Knowledge from Human-Object Interaction

机译:告诉我他们持有什么:弱监督的对象检测从人对象交互中具有可转移的知识

获取原文

摘要

In this work, we introduce a novel weakly supervised object detection (WSOD) paradigm to detect objects belonging to rare classes that have not many examples using transferable knowledge from human-object interactions (HOI). While WSOD shows lower performance than full supervision, we mainly focus on HOI as the main context which can strongly supervise complex semantics in images. Therefore, we propose a novel module called RRPN (relational region proposal network) which outputs an object-localizing attention map only with human poses and action verbs. In the source domain, we fully train an object detector and the RRPN with full supervision of HOI. With transferred knowledge about localization map from the trained RRPN, a new object detector can learn unseen objects with weak verbal supervision of HOI without bounding box annotations in the target domain. Because the RRPN is designed as an add-on type, we can apply it not only to the object detection but also to other domains such as semantic segmentation. The experimental results on HICO-DET dataset show the possibility that the proposed method can be a cheap alternative for the current supervised object detection paradigm. Moreover, qualitative results demonstrate that our model can properly localize unseen objects on HICO-DET and V-COCO datasets.
机译:在这项工作中,我们介绍了一种小型弱监督的对象检测(WSOD)范式,以检测属于罕见类的对象,其中使用来自人对象交互(Hoi)的可转移知识没有许多示例。虽然WSOD显示出比完全监督更低的性能,但我们主要关注会议作为可能在图像中强烈监督复杂语义的主要背景。因此,我们提出了一个名为RRPN(关系区域提议网络)的新型模块,其仅输出对象本地化注意力映射,只能与人类的姿势和动作动词。在源域中,我们完全培训了对象探测器和RRPN,全面监督了Hoi。通过从训练的RRPN传输关于本地化地图的知识,新的对象探测器可以使用目标域中的边界框注释来学习具有HOI的弱势口头监督的未经遵守的对象。由于RRPN被设计为附加类型,因此我们不仅可以应用于对象检测,而且可以应用于其他域,例如语义分割。 HiCO-DIC数据集的实验结果表明,所提出的方法可以是当前监督对象检测范式的便宜替代方案。此外,定性结果表明,我们的模型可以在Hico-DET和V-Coco数据集上妥善定位看不见的对象。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号