Action recognition using saliency learned from recorded human gaze

Stefic Daria; Patras Ioannis

首页> 外文期刊>Image and Vision Computing >Action recognition using saliency learned from recorded human gaze

【24h】

Action recognition using saliency learned from recorded human gaze

机译：使用从记录的人类凝视中学到的显着性进行动作识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the problem of recognition and localization of actions in image sequences, by utilizing, in the training phase only, gaze tracking data of people watching videos depicting the actions in question. First, we learn discriminative action features at the areas of gaze fixation and train a Convolutional Network that predicts areas of fixation (i.e. salient regions) from raw image data. Second, we propose a Support Vector Machine-based recognition method for joint recognition and localization, in which the bounding box of the action in question is considered as a latent variable. In our formulation the optimization attempts to both minimize the classification cost and maximize the saliency within the bounding box. We show that the results obtained with the optimization where saliency within the bounding box is maximized outperform the results obtained when saliency within the bounding box is not maximized, i.e. when only classification cost is minimized. Furthermore, the results that we obtain outperform the state-of-the-art results on the UCF sports dataset. (C) 2016 Elsevier B.V. All rights reserved.

机译：本文仅通过在训练阶段利用观看视频中描述相关动作的人们的凝视跟踪数据，来解决图像序列中动作的识别和定位问题。首先，我们在凝视注视区域学习判别动作特征，并训练一个卷积网络，该网络根据原始图像数据预测注视区域（即显着区域）。其次，我们提出了一种基于支持向量机的联合识别和定位方法，该方法将所考虑动作的边界框视为潜在变量。在我们的公式化中，优化尝试既使分类成本最小化，又使边界框内的显着性最大化。我们表明，在边界框内的显着性最大化的情况下，通过优化获得的结果优于在边界框内的显着性未最大化（即仅将分类成本最小化）时获得的结果。此外，我们获得的结果优于UCF运动数据集上的最新结果。（C）2016 Elsevier B.V.保留所有权利。

著录项

来源
《Image and Vision Computing》 |2016年第8期|195-205|共11页
作者
Stefic Daria; Patras Ioannis;
展开▼
作者单位

Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England;

Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Action recognition; Saliency; Support Vector Machine (SVM); Latent variable; 3D Convolutional Neural Network (3D CNN);

机译：动作识别;显着性;支持向量机（SVM）;潜在变量;3D卷积神经网络（3D CNN）;

相似文献

外文文献
中文文献
专利

1. Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition [J] . Mathe Stefan, Sminchisescu Cristian Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2015,第7期

机译：眼中的动作：用于视觉识别的动态凝视数据集和学习的显着性模型
2. The impact of salient action effects on 6-, 7-, and 11-month-olds’ goal-predictive gaze shifts for a human grasping action [J] . Maurits Adam, Birgit Elsner PLoS One . 2020,第10期

机译：突出作用效应对人类抓握行动的6个，7个和11个月和11个月的目标预测凝视变化的影响
3. Augmented saliency model using automatic 3D head pose detection and learned gaze following in natural scenes [J] . Parks Daniel, Borji Ali, Itti Laurent Vision Research: An International Journal in Visual Science . 2015,第Null期

机译：在自然场景中使用自动3D头部姿势检测和学习的注视增强后的显着性模型
4. Eye Gaze and Interaction Differences of Holistic Versus Analytic Users in Image-Recognition Human Interaction Proof Schemes [C] . Pantelitsa Leonidou, Argyris Constantinides, Marios Belk, International Conference on HCI for Cybersecurity, Privacy and Trust;International Conference on Human-Computer Interaction . 2021

机译：图像识别人类互动方案中整体与分析用户的眼光和相互作用差异
5. Analyzing human gaze adaptive to robot gaze in real-time human robot interaction. [D] . Shen, Hongwei. 2012

机译：在实时人机交互中分析人的注视以适应机器人的注视。
6. The impact of salient action effects on 6- 7- and 11-month-olds’ goal-predictive gaze shifts for a human grasping action [O] . Maurits Adam, Birgit Elsner 2020

机译：突出作用效应对6-7 - 11个月和11个月的目标预测凝视的影响为人类抓握行动
7. Action recognition using saliency learned from recorded human gaze [O] . Daria Stefic, Ioannis Patras 2016

机译：使用显着性从录制的人凝视中获得的动作识别

Action recognition using saliency learned from recorded human gaze

摘要

著录项

相似文献

相关主题

期刊订阅