Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition

机译：通过学习任务相关的注意力转移来预测以自我为中心的视频中的凝视

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a new computational model for gaze prediction in egocentric videos by exploring patterns in temporal shift of gaze fixations (attention transition) that are dependent on egocentric manipulation tasks. Our assumption is that the high-level context of how a task is completed in a certain way has a strong influence on attention transition and should be modeled for gaze prediction in natural dynamic scenes. Specifically, we propose a hybrid model based on deep neural networks which integrates task-dependent attention transition with bottom-up saliency prediction. In particular, the task-dependent attention transition is learned with a recurrent neural network to exploit the temporal context of gaze fixations, e.g. looking at a cup after moving gaze away from a grasped bottle. Experiments on public egocentric activity datasets show that our model significantly outperforms state-of-the-art gaze prediction methods and is able to learn meaningful transition of human attention.

机译：通过探索依赖于以自我为中心的操纵任务的凝视注视的时间变化（注意力转移）的模式，我们提出了以自我为中心的视频中凝视预测的新计算模型。我们的假设是，以某种方式完成任务的高级上下文对注意力转移有很大影响，因此应该为自然动态场景中的凝视预测建模。具体来说，我们提出了一种基于深度神经网络的混合模型，该模型将任务依赖的注意力转移与自下而上的显着性预测相结合。特别地，通过递归神经网络学习依赖于任务的注意力转移，以利用凝视注视的时间上下文，例如视线凝视。将目光从紧紧抓住的瓶子上移开后，看着杯子。在以公众为中心的活动数据集上进行的实验表明，我们的模型明显优于最新的注视预测方法，并且能够学习有意义的人类注意力转移。

著录项

来源
《European conference on computer vision》|2018年|789-804|共16页
会议地点
作者
Yifei Huang; Minjie Cai; Zhenqiang Li; Yoichi Sato;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Gaze prediction; Egocentric video; Attention transition;

机译：注视预测;以自我为中心的视频;注意转移;

相似文献

外文文献
中文文献
专利

1. Seeing the instructor's face and gaze in demonstration video examples affects attention allocation but not learning [J] . van Wermeskerken Margot, van Gog Tamara Computers & education . 2017,第octa期

机译：在演示视频示例中看到教练员的脸和目光会影响注意力分配，但不会影响学习
2. Eyes always attract attention but gaze orienting is task-dependent: evidence from eye movement monitoring. [J] . Itier RJ, Villate C, Ryan JD Neuropsychologia . 2007,第5期

机译：眼睛总是吸引眼球，但凝视的方向取决于任务：眼睛运动监测的证据。
3. Goal-oriented top-down probabilistic visual attention model for recognition of manipulated objects in egocentric videos [J] . Buso Vincent, Gonzalez-Diaz Ivan, Benois-Pineau Jenny Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2015,第Pta2期

机译：面向目标的自上而下的概率视觉注意力模型，用于识别以自我为中心的视频中的被操纵对象
4. Learning to Predict Gaze in Egocentric Video [C] . Li Yin, Fathi Alireza, Rehg James M. IEEE International Conference on Computer Vision . 2013

机译：学习预测以自我为中心的视频中的凝视
5. Human Activity Recognition from Egocentric Videos and Robustness Analysis of Deep Neural Networks [D] . Lu, Yantao. 2020

机译：从深神经网络的Egentric视频和鲁棒性分析的人类活动识别
6. Gaze-enabled Egocentric Video Summarization via Constrained Submodular Maximization [O] . Jia Xut, Lopamudra Mukherjee, Yin Li, -1

机译：通过约束子模最大化实现凝视的自我中心视频汇总
7. Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition [O] . Yifei Huang, Minjie Cai, Zhenqiang Li, 2018

机译：通过学习任务依赖的注意力转换来预测Egentric视频中的凝视

Predicting Gaze in Egocentric Video by Learning Task-Dependent Attention Transition

摘要

著录项

相似文献

相关主题

期刊订阅