Detecting and Recognizing Human-Object Interactions

机译：检测和识别人与物体的相互作用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To understand the visual world, a machine must not only recognize individual object instances but also how they interact. Humans are often at the center of such interactions and detecting human-object interactions is an important practical and scientific problem. In this paper, we address the task of detecting (human, verb, object) triplets in challenging everyday photos. We propose a novel model that is driven by a human-centric approach. Our hypothesis is that the appearance of a person - their pose, clothing, action - is a powerful cue for localizing the objects they are interacting with. To exploit this cue, our model learns to predict an action-specific density over target object locations based on the appearance of a detected person. Our model also jointly learns to detect people and objects, and by fusing these predictions it efficiently infers interaction triplets in a clean, jointly trained end-to-end system we call InteractNet. We validate our approach on the recently introduced Verbs in COCO (V-COCO) and HICO-DET datasets, where we show quantitatively compelling results.

机译：为了理解视觉世界，机器不仅必须识别单个对象实例，还必须识别它们如何交互。人们通常处于这种交互的中心，而检测人与对象之间的交互是一个重要的实践和科学问题。在本文中，我们解决了在具有挑战性的日常照片中检测（人，动词，宾语）三胞胎的任务。我们提出了一种以人为本的方法驱动的新颖模型。我们的假设是，一个人的外表-他们的姿势，衣服，动作-是确定与他们互动的对象的有力提示。为了利用这一线索，我们的模型学习了根据检测到的人的外表来预测目标对象位置上特定于动作的密度。我们的模型还共同学习检测人和物体，并且通过融合这些预测，可以在干净的，经过共同训练的端对端系统（我们称为InteractNet）中有效地推断出三元组。我们在最近引入的COCO（V-COCO）和HICO-DET数据集中的动词上验证了我们的方法，这些结果显示了令人信服的结果。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|8359-8367|共9页
会议地点 Salt Lake City(US)
作者
Georgia Gkioxari; Ross Girshick; Piotr Dollár; Kaiming He;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Visualization; Object detection; Predictive models; Task analysis; Target recognition; Image recognition;

机译：特征提取;可视化；对象检测；预测模型；任务分析；目标识别；影像识别;
入库时间 2022-08-26 14:35:33

相似文献

外文文献
中文文献
专利

1. Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses [J] . Yao Bangpeng, Fei-Fei Li Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2012,第9期

机译：通过对物体和人体姿势的相互关系建模来识别静止图像中的人物体相互作用
2. GID-Net: Detecting human-object interaction with global and instance dependency [J] . Yang Dongming, Zou YueXian, Zhang Jian, Neurocomputing . 2021,第Jula15期

机译：gid net：用全局和实例依赖检测人对象交互
3. A self-organizing neural network architecture for learning human-object interactions [J] . Mici Luiza, Parisi German I., Wermter Stefan Neurocomputing . 2018,第SEPa13期

机译：自组织神经网络体系结构，用于学习人与对象之间的交互
4. Detecting and Recognizing Human-Object Interactions [C] . Georgia Gkioxari, Ross Girshick, Piotr Dollár, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：检测和识别人对象相互作用
5. Detecting and Recognizing Humans, Objects, and Their Interactions [D] . Bansal, Ankan. 2020

机译：检测和识别人类，物体及其互动
6. Scaling Human-Object Interaction Recognition in the Video through Zero-Shot Learning [O] . Vali Ollah Maraghi, Karim Faez 2021

机译：通过零射击学习将人类对象交互识别缩放
7. Pairwise Body-Part Attention for Recognizing Human-Object Interactions [O] . Hao-Shu Fang, Jinkun Cao, Yu-Wing Tai, 2018

机译：成对的身体部位注意识别人对象相互作用

Detecting and Recognizing Human-Object Interactions

摘要

著录项

相似文献

相关主题

期刊订阅