Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization

机译：行动是在旁观者的眼中：眼睛凝视驱动的时空作用定位模型

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a weakly-supervised structured learning approach for recognition and spatio-temporal localization of actions in video. As part of the proposed approach, we develop a generalization of the Max-Path search algorithm which allows us to efficiently search over a structured space of multiple spatio-temporal paths while also incorporating context information into the model. Instead of using spatial annotations in the form of bounding boxes to guide the latent model during training, we utilize human gaze data in the form of a weak supervisory signal. This is achieved by incorporating eye gaze, along with the classification, into the structured loss within the latent SVM learning framework. Experiments on a challenging benchmark dataset, UCF-Sports, show that our model is more accurate, in terms of classification, and achieves state-of-the-art results in localization. In addition, our model can produce top-down saliency maps conditioned on the classification label and localized latent paths.

机译：我们提出了一种弱监督的结构化学习方法，可用于视频中行动的识别和时空定位。作为所提出的方法的一部分，我们开发了MAX-PATL搜索算法的概括，其允许我们有效地搜索多个时空路径的结构化空间，同时还将上下文信息结合到模型中。而不是使用边界框形式的空间注释来指导培训期间的潜在模型，我们利用弱监管信号的形式使用人的凝视数据。这是通过将眼睛凝视与分类结合到潜伏的SVM学习框架内的结构化损失中来实现的。在一个具有挑战性的基准数据集，UCF运动的实验表明，在分类方面，我们的模型更准确，并实现了最先进的本地化。此外，我们的模型可以在分类标签和本地化潜在路径上产生自上而下的显着性图。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2013年||共9页
会议地点
作者
Nataliya Shapovalova; Michalis Raptis; Leonid Sigal; Greg Mori;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Eyeball model-based iris center localization for visible image-based eye-gaze tracking systems [J] . Seung-Jin Baek, Kang-A Choi, Chunfei Ma, IEEE Transactions on Consumer Electronics . 2013,第2期

机译：基于眼球模型的虹膜中心定位，用于基于可见图像的视线跟踪系统
2. Predicting Athlete Ground Reaction Forces and Moments From Spatio-Temporal Driven CNN Models [J] . Johnson William Robert, Alderson Jacqueline, Lloyd David, IEEE Transactions on Biomedical Engineering . 2019,第3期

机译：从时空驱动的CNN模型预测运动员的地面反作用力和力矩
3. In the eye of the beholder? An eye-tracking experiment on emergent leadership in team interactions [J] . Gerpott Fabiola H., Lehmann-Willenbrock Nale, Silvis Jeroen D., The Leadership quarterly . 2018,第4期

机译：在情人眼中？团队互动中新兴领导力的眼球追踪实验
4. Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization [C] . Nataliya Shapovalova, Michalis Raptis, Leonid Sigal, Annual conference on Neural Information Processing Systems . 2013

机译：行动在情人眼中：时空行动本地化的视线驱动模型
5. Optimization of batch reactions using data-driven & knowledge-driven models: The case of asymmetric catalytic hydrogenation. [D] . Makrydaki, Foteini. 2010

机译：使用数据驱动和知识驱动模型优化间歇反应：不对称催化加氢的情况。
6. Communicative Interaction with and without Eye-Gaze Technology between Children and Youths with Complex Needs and Their Communication Partners [O] . Yu-Hsin Hsieh, Maria Borgestig, Deepika Gopalarao, 2021

机译：与复杂需求和他们的通信合作伙伴之间的儿童与青年之间的沟通互动和没有眼睛凝视技术
7. Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization [O] . Nataliya Shapovalova, Michalis Raptis, Leonid Sigal, 2014

机译：行动在情人眼中：时空行为本地化的视线驱动模型

Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅