Actor and Observer: Joint Modeling of First and Third-Person Videos

机译：演员和观察员：第一人称和第三人称视频的联合建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Several theories in cognitive neuroscience suggest that when people interact with the world, or simulate interactions, they do so from a first-person egocentric perspective, and seamlessly transfer knowledge between third-person (observer) and first-person (actor). Despite this, learning such models for human action recognition has not been achievable due to the lack of data. This paper takes a step in this direction, with the introduction of Charades-Ego, a large-scale dataset of paired first-person and third-person videos, involving 112 people, with 4000 paired videos. This enables learning the link between the two, actor and observer perspectives. Thereby, we address one of the biggest bottlenecks facing egocentric vision research, providing a link from first-person to the abundant third-person data on the web. We use this data to learn a joint representation of first and third-person videos, with only weak supervision, and show its effectiveness for transferring knowledge from the third-person to the first-person domain.

机译：认知神经科学中的几种理论表明，当人们与世界互动或模拟互动时，他们是从第一人称自我中心角度进行的，并在第三人称（观察者）和第一人称（演员）之间无缝地传递知识。尽管如此，由于缺乏数据，仍无法实现这种用于人类动作识别的模型。本文朝着这个方向迈出了一步，引入了Charades-Ego，这是一个由第一人称和第三人称视频配对的大规模数据集，涉及112人，有4000个配对视频。这使得能够了解演员和观察者视角之间的联系。因此，我们解决了以自我为中心的视觉研究面临的最大瓶颈之一，它提供了从第一人称视角到网络上丰富的第三人称数据的链接。我们使用此数据来学习第一人称视频和第三人称视频的联合表示，而仅需很少的监督，并显示其将知识从第三人称转移到第一人称领域的有效性。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|7396-7404|共9页
会议地点 Salt Lake City(US)
作者
Gunnar A. Sigurdsson; Abhinav Gupta; Cordelia Schmid; Ali Farhadi; Karteek Alahari;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Videos; Cameras; Portable computers; Data models; Task analysis; Computational modeling; Computer vision;

机译：影片；相机；便携式计算机；数据模型；任务分析；计算建模；计算机视觉;

相似文献

外文文献
中文文献
专利

1. Classification of Event-Related Potentials Associated with Response Errors in Actors and Observers Based on Autoregressive Modeling [J] . Revision Received Date: The Open Medical Informatics Journal . 2009,第1期

机译：基于自回归建模的演员和观察员中与响应错误相关的事件相关电位的分类
2. A human behavior model of multi-agent attention based on actor-observer switching for asynchronous motion tasks with limited field of view [J] . Zhang-Xu Tingting, Kuehnlenz Kolja Advanced Robotics: The International Journal of the Robotics Society of Japan . 2019,第13a14期

机译：基于参与者观察者对异步运动任务进行有限视野的actoror-Observer切换的人为行为模型
3. Model-observer similarity and task-appropriateness in learning from video modeling examples: Do model and student gender affect test performance, self-efficacy, and perceived competence? [J] . Hoogerheide Vincent, van Wermeskerken Margot, van Nassau Hilke, Computers in Human Behavior . 2018,第DECa期

机译：从视频建模示例中学习时，模型观察者的相似性和任务适合性：模型和学生的性别是否会影响测试性能，自我效能和感知能力？
4. Actor and Observer: Joint Modeling of First and Third-Person Videos [C] . Gunnar A. Sigurdsson, Abhinav Gupta, Cordelia Schmid, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：演员和观察者：第一和第三人称的联合建模
5. The impact of the use of video recording eyewear on skill acquisition: A comparison of first-person and third-person perspective video modeling. [D] . Jaeger, David. 2016

机译：使用视频录像眼镜对技能习得的影响：第一人称视角和第三人称视角视频建模的比较。
6. Classification of Event-Related Potentials Associated with Response Errors in Actors and Observers Based on Autoregressive Modeling [O] . Christos E Vasios, Errikos M Ventouras, George K Matsopoulos, 2009

机译：基于自回归建模的演员和观察者中与响应错误相关的事件相关电位的分类
7. Joint Person Segmentation and Identification in Synchronized First- and Third-Person Videos [O] . Mingze Xu, Chenyou Fan, Yuchen Wang, 2018

机译：同步的第一和第三人称视频中的联合人分割和识别
8. Model of Plan Inference That Distinguishes Between the Beliefs of Actors and Observers. [R] . Pollack, M. E. 1986

机译：区分行动者和观察者信念的计划推理模型。

Actor and Observer: Joint Modeling of First and Third-Person Videos

摘要

著录项

相似文献

相关主题

期刊订阅