首页> 外文会议>AAAI Conference on Artificial Intelligence >Tell Me What They're Holding: Weakly-Supervised Object Detection with Transferable Knowledge from Human-Object Interaction

【24h】

Tell Me What They're Holding: Weakly-Supervised Object Detection with Transferable Knowledge from Human-Object Interaction

机译：告诉我他们持有什么：弱监督的对象检测从人对象交互中具有可转移的知识

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we introduce a novel weakly supervised object detection (WSOD) paradigm to detect objects belonging to rare classes that have not many examples using transferable knowledge from human-object interactions (HOI). While WSOD shows lower performance than full supervision, we mainly focus on HOI as the main context which can strongly supervise complex semantics in images. Therefore, we propose a novel module called RRPN (relational region proposal network) which outputs an object-localizing attention map only with human poses and action verbs. In the source domain, we fully train an object detector and the RRPN with full supervision of HOI. With transferred knowledge about localization map from the trained RRPN, a new object detector can learn unseen objects with weak verbal supervision of HOI without bounding box annotations in the target domain. Because the RRPN is designed as an add-on type, we can apply it not only to the object detection but also to other domains such as semantic segmentation. The experimental results on HICO-DET dataset show the possibility that the proposed method can be a cheap alternative for the current supervised object detection paradigm. Moreover, qualitative results demonstrate that our model can properly localize unseen objects on HICO-DET and V-COCO datasets.

机译：在这项工作中，我们介绍了一种小型弱监督的对象检测（WSOD）范式，以检测属于罕见类的对象，其中使用来自人对象交互（Hoi）的可转移知识没有许多示例。虽然WSOD显示出比完全监督更低的性能，但我们主要关注会议作为可能在图像中强烈监督复杂语义的主要背景。因此，我们提出了一个名为RRPN（关系区域提议网络）的新型模块，其仅输出对象本地化注意力映射，只能与人类的姿势和动作动词。在源域中，我们完全培训了对象探测器和RRPN，全面监督了Hoi。通过从训练的RRPN传输关于本地化地图的知识，新的对象探测器可以使用目标域中的边界框注释来学习具有HOI的弱势口头监督的未经遵守的对象。由于RRPN被设计为附加类型，因此我们不仅可以应用于对象检测，而且可以应用于其他域，例如语义分割。 HiCO-DIC数据集的实验结果表明，所提出的方法可以是当前监督对象检测范式的便宜替代方案。此外，定性结果表明，我们的模型可以在Hico-DET和V-Coco数据集上妥善定位看不见的对象。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2020年|11101-11790p|共8页
会议地点
作者
Daesik Kim; Gyujeong Lee; Jisoo Jeong; Nojun Kwak;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Human-object interaction detection with missing objects [J] . Kogashi Kaen, Wu Yang, Nobuhara Shohei, Image and Vision Computing . 2021,第Sepa期

机译：用缺失对象的人体对象交互检测
2. Multi-stream Network for Human-object Interaction Detection [J] . Wang Chang, Sun Jinyu, Ma Shiwei, International Journal of Pattern Recognition and Artificial Intelligence . 2021,第8期

机译：用于人对象交互检测的多流网络
3. Interact as You Intend: Intention-Driven Human-Object Interaction Detection [J] . Xu Bingjie, Li Junnan, Wong Yongkang, IEEE transactions on multimedia . 2020,第6期

机译：当您打算互动：有意驱动的人体对象交互检测
4. Tell Me What They're Holding: Weakly-Supervised Object Detection with Transferable Knowledge from Human-Object Interaction [C] . Daesik Kim, Gyujeong Lee, Jisoo Jeong, AAAI Conference on Artificial Intelligence . 2020

机译：告诉我他们持有什么：弱监督的对象检测从人对象交互中具有可转移的知识
5. Exploring Human-Object Interaction Detection [D] . Bergstrom, Trevor. 2020

机译：探索人对象交互检测
6. Scaling Human-Object Interaction Recognition in the Video through Zero-Shot Learning [O] . Vali Ollah Maraghi, Karim Faez 2021

机译：通过零射击学习将人类对象交互识别缩放
7. Transferable Interactiveness Knowledge for Human-Object Interaction Detection [O] . Yong-Lu Li, Xinpeng Liu, Xiaoqian Wu, 2021

机译：可转移的人体对象交互检测的相互作用知识
8. Report to Respondents on Knowledge-Holding Studies: Descriptive Statistics for Knowledge-Holding Studies of S.E. Utah, S.W. Colorado/N.W. New Mexico, and Colorado Plateau Opinion Leaders [R] . Lamb, B. L., Ponds, P. D. 1999

机译：向知识持有研究的受访者汇报：s.E. Utah，s.W。Colorado / N.W. New mexico和Colorado plateau Opinion Leaders的知识持有研究的描述性统计

Tell Me What They're Holding: Weakly-Supervised Object Detection with Transferable Knowledge from Human-Object Interaction

摘要

著录项

相似文献

相关主题

期刊订阅