In the Eye of Beholder: Joint Learning of Gaze and Actions in First Person Video

机译：在情人眼中：第一人称视频中凝视与动作的共同学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the task of jointly determining what a person is doing and where they are looking based on the analysis of video captured by a headworn camera. We propose a novel deep model for joint gaze estimation and action recognition in First Person Vision. Our method describes the participant's gaze as a probabilistic variable and models its distribution using stochastic units in a deep network. We sample from these stochastic units to generate an attention map. This attention map guides the aggregation of visual features in action recognition, thereby providing coupling between gaze and action. We evaluate our method on the standard EGTEA dataset and demonstrate performance that exceeds the state-of-the-art by a significant margin of 3.5%.

机译：我们基于对头戴式摄像机拍摄的视频进行分析，共同确定一个人在做什么和在看什么的任务。我们提出了一种新颖的深度模型，用于“第一人称”视觉中的联合注视估计和动作识别。我们的方法将参与者的凝视描述为一个概率变量，并使用深度网络中的随机单位来模拟其分布。我们从这些随机单位中采样以生成注意力图。该注意图指导动作识别中视觉特征的聚集，从而提供凝视与动作之间的耦合。我们在标准EGTEA数据集上评估了我们的方法，并证明其性能比最新技术高出3.5％。

著录项

来源
《European conference on computer vision》|2018年|639-655|共17页
会议地点
作者
Yin Li; Miao Liu; James M. Rehg;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Action in the eye of the beholder: Goal-oriented gaze strategies [J] . Belardinelli Anna, Butz Martin V. Cognitive processing . 2015,第Suppla期

机译：情人眼中的行动：面向目标的注视策略
2. Understanding human activities in videos: A joint action and interaction learning approach [J] . Wang Zhenhua, Jin Jiali, Liu Tong, Neurocomputing . 2018,第DECa10期

机译：了解视频中的人类活动：联合行动和互动学习方法
3. Eye contact enhances interpersonal motor resonance: comparing video stimuli to a live two-person action context [J] . Prinsen Jellina, Alaerts Kaat Social cognitive and affective neuroscience . 2019,第9期

机译：眼睛接触增强了人际电机共鸣：将视频刺激与实时两人动作背景进行比较
4. Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization [C] . Nataliya Shapovalova, Michalis Raptis, Leonid Sigal, Annual conference on Neural Information Processing Systems . 2013

机译：行动在情人眼中：时空行动本地化的视线驱动模型
5. Visual representations in augmentative and alternative communication (AAC): An eye tracking study to explore influences of abstraction, realism, and familiarity on the gaze patterns of a person with Angelman syndrome who uses AAC technologies and implications for research and the art classroom. [D] . Allen, Nicole E. 2016

机译：补充性和替代性交流（AAC）中的视觉表示：一项眼动追踪研究，探讨抽象，现实主义和熟悉度对使用AAC技术及其对研究和艺术教室的影响的Angelman综合征患者凝视模式的影响。
6. Absorbing the gaze scattering looks: Klimt’s distinctive style and its two-fold effect on the eye of the beholder [O] . Anna Miscenà, Jozsef Arato, Raphael Rosenberg 2020

机译：吸收凝视散射看起来：Klimt的独特风格及其对旁观者眼睛的两倍效果
7. In the Eye of the Beholder: Gaze and Actions in First Person Video [O] . Yin Li, Miao Liu, Jame Rehg 2021

机译：在旁观者的眼中：凝视和第一人称视频的行动

In the Eye of Beholder: Joint Learning of Gaze and Actions in First Person Video

摘要

著录项

相似文献

相关主题

期刊订阅