Where and Why are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks

机译：他们在哪里看，为什么看？共同推断人类的注意力和复杂任务中的意图

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses a new problem - jointly inferring human attention, intentions, and tasks from videos. Given an RGB-D video where a human performs a task, we answer three questions simultaneously: 1) where the human is looking - attention prediction; 2) why the human is looking there - intention prediction; and 3) what task the human is performing - task recognition. We propose a hierarchical model of human-attention-object (HAO) which represents tasks, intentions, and attention under a unified framework. A task is represented as sequential intentions which transition to each other. An intention is composed of the human pose, attention, and objects. A beam search algorithm is adopted for inference on the HAO graph to output the attention, intention, and task results. We built a new video dataset of tasks, intentions, and attention. It contains 14 task classes, 70 intention categories, 28 object classes, 809 videos, and approximately 330,000 frames. Experiments show that our approach outperforms existing approaches.

机译：本文解决了一个新问题-通过视频共同推断出人类的注意力，意图和任务。给定一个人在执行任务的RGB-D视频，我们同时回答三个问题：1）人在看什么-注意预测； 2）人们为什么向那里看-意图预测； 3）人类正在执行的任务-任务识别。我们提出了一个人类注意力对象（HAO）的分层模型，该模型在一个统一的框架下表示任务，意图和注意力。任务表示为彼此过渡的顺序意图。意图由人类的姿势，注意力和物体组成。采用波束搜索算法对HAO图进行推理，以输出注意力，意图和任务结果。我们建立了一个新的任务，意图和注意力的视频数据集。它包含14个任务类别，70个意图类别，28个对象类别，809个视频和大约330,000个帧。实验表明，我们的方法优于现有方法。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|6801-6809|共9页
会议地点 Salt Lake City(US)
作者
Ping Wei; Yang Liu; Tianmin Shu; Nanning Zheng; Song-Chun Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Three-dimensional displays; Videos; Feature extraction; Two dimensional displays; Skeleton; Bridges;

机译：任务分析；三维显示器；影片；特征提取;二维显示；骨架;桥梁;
入库时间 2022-08-26 14:35:30

相似文献

外文文献
中文文献
专利

1. 从任务特性——任务难度、任务复杂度看意义协商(英文) [J] . 乔丽娟中国应用语言学：英文版 . 2010,第004期
2. Labor division in joint tasks: Humans maximize use of their individual attentional capacities [J] . Wahn Basil, Kingstone Alan Attention, perception & psychophysics . 2020,第6期

机译：联合任务中的劳动部门：人类最大限度地利用他们的个人注意力
3. Investigating gaze processing in euthymic bipolar disorder: Impaired ability to infer mental state and intention, but preservation of social attentional orienting [J] . Marotta Andrea, Delle Chiaie Roberto, Bernabei Laura, The quarterly journal of experimental psychology: QJEP . 2018,第10期

机译：调查凝视加工在肠外切片障碍：推断精神状态和意图的能力受损，但保护社会注意力定位
4. Task Models for Inferring Team Intentions [J] . Martin Giersich, Thomas Kirste Kunstliche Intelligenz . 2007,第4期

机译：推断团队意图的任务模型
5. Where and Why are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks [C] . Ping Wei, Yang Liu, Tianmin Shu, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：他们在哪里和为什么看？在复杂任务中共同推断人类注意力和意图
6. Learning and Inferring Human Intentions: Explorations of Driver Attention and Interactivity [D] . Doshi, Anup 2010

机译：学习和推断人类意图：驾驶员注意力和互动性的探索
7. Effects of Attentional Control on Gait and Inter-Joint Coordination During Dual-Task Walking [O] . Cenyi Wang, Guodong Wang, Aming Lu, 2021

机译：注意力控制对双任务步行步态和联合间协调的影响
8. Eyeing and grasping the other's intentions : when inferring another's acts affects our own attention [O] . Nuku Pines 2008

机译：眼神和掌握对方的意图：推断他人的行为会影响我们自己的注意力
9. Autonomous Learning of Task Skills and Human Intention for Enhancing Human Trust of Robot Systems. [R] . Suh, I. H. 2017

机译：自主学习任务技能和人类意图，增强机器人系统的人类信任。

Where and Why are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks

摘要

著录项

相似文献

相关主题

期刊订阅