首页> 外文会议>European Conference on Computer Vision >Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video

【24h】

Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video

机译：预测人对象互动：第一人体视频中电机关注的联合预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the challenging task of anticipating human-object interaction in first person videos. Most existing methods either ignore how the camera wearer interacts with objects, or simply considers body motion as a separate modality. In contrast, we observe that the intentional hand movement reveals critical information about the future activity. Motivated by this observation, we adopt intentional hand movement as a feature representation, and propose a novel deep network that jointly models and predicts the egocentric hand motion, interaction hotspots and future action. Specifically, we consider the future hand motion as the motor attention, and model this attention using probabilistic variables in our deep model. The predicted motor attention is further used to select the discriminative spatial-temporal visual features for predicting actions and interaction hotspots. We present extensive experiments demonstrating the benefit of the proposed joint model. Importantly, our model produces new state-of-the-art results for action anticipation on both EGTEA Gaze+ and the EPIC-Kitchens datasets.

机译：我们解决了在第一人称视频中预测人对象互动的具有挑战性的任务。大多数现有方法既忽略相机佩戴者如何与对象交互，或者只是将身体运动视为单独的模态。相比之下，我们观察到故意的手势揭示了关于未来活动的关键信息。通过这种观察，我们采用故意手工作为特征表示，并提出了一种新的深度网络，共同模型和预测Epentric手动运动，交互热点和未来动作。具体而言，我们考虑未来的手动运动作为电机的注意力，并在我们深层模型中使用概率变量来模拟这种注意。预测的电机注意力还用于选择用于预测动作和交互热点的判别空间 - 时间视觉特征。我们呈现出广泛的实验，证明了提出的联合模型的利益。重要的是，我们的模型为Egtea Gaze +和史诗厨房数据集产生了新的最先进结果。

著录项

来源
《European Conference on Computer Vision》|2020年|704-721|共18页
会议地点
作者
Miao Liu; Siyu Tang; Yin Li; James M. Rehg;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
First Person Vision; Action anticipation; Motor attention;

机译：第一人称愿景;行动预期;运动注意力;
入库时间 2022-08-26 13:53:38

相似文献

外文文献
中文文献
专利

1. Explicit Modeling of Human-Object Interactions in Realistic Videos [J] . Prest Alessandro, Ferrari Vittorio, Schmid Cordelia Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2013,第4期

机译：真实视频中人与对象交互的显式建模
2. Modeling 4D Human-Object Interactions for Joint Event Segmentation, Recognition, and Object Localization [J] . Ping Wei, Yibiao Zhao, Nanning Zheng, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2017,第6期

机译：为联合事件分割，识别和对象本地化建模4D人与对象交互
3. Scaling Human-Object Interaction Recognition in the Video through Zero-Shot Learning [J] . Vali Ollah Maraghi, Karim Faez Computational intelligence and neuroscience . 2021,第a期

机译：通过零射击学习将人类对象交互识别缩放
4. Imitation based human-robot interaction - roles of joint attention and motion prediction [C] . Akiwa Y., Ogata T., Suga Y., Robot and Human Interactive Communication, 2004. ROMAN 2004. 13th IEEE International Workshop on . 2004

机译：基于模仿的人机交互-联合注意和运动预测的作用
5. JOB SATISFACTION, OCCUPATIONAL STRESS AND VIDEO-DISPLAY-TERMINAL WORK: AN INTERACTIONAL MODEL OF PERSON-ENVIRONMENT FIT [D] . CHADROW, MINDY ELLEN. 1983

机译：工作满意度，职业压力和视频显示终端工作：人际关系互动模型
6. Scaling Human-Object Interaction Recognition in the Video through Zero-Shot Learning [O] . Vali Ollah Maraghi, Karim Faez 2021

机译：通过零射击学习将人类对象交互识别缩放
7. Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video [O] . Miao Liu, Siyu Tang, Yin Li, 2020

机译：预测人对象互动：第一人体视频中电机关注的联合预测

Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video

摘要

著录项

相似文献

相关主题

期刊订阅