Scaling Egocentric Vision: The EPIC-KITCHENS Dataset

机译：扩展以自我为中心的视觉：EPIC-KITCHENS数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

First-person vision is gaining interest as it offers a unique viewpoint on people's interaction with objects, their attention, and even intention. However, progress in this challenging domain has been relatively slow due to the lack of sufficiently large datasets. In this paper, we introduce EPIC-KITCHENS, a large-scale egocentric video benchmark recorded by 32 participants in their native kitchen environments. Our videos depict non-scripted daily activities: we simply asked each participant to start recording every time they entered their kitchen. Recording took place in 4 cities (in North America and Europe) by participants belonging to 10 different nationalities, resulting in highly diverse cooking styles. Our dataset features 55h of video consisting of 11.5M frames, which we densely labelled for a total of 39.6K action segments and 454.3K object bounding boxes. Our annotation is unique in that we had the participants narrate their own videos (after recording), thus reflecting true intention, and we crowd-sourced ground-truths based on these. We describe our object, action and anticipation challenges, and evaluate several baselines over two test splits, seen and unseen kitchens.

机译：第一人称视角越来越引起人们的兴趣，因为它为人们与物体的互动，他们的注意力甚至意图提供了独特的观点。但是，由于缺乏足够大的数据集，这一具有挑战性的领域的进展相对缓慢。在本文中，我们介绍了EPIC-KITCHENS，这是一个大型的以自我为中心的视频基准测试，由32位参与者在其本机厨房环境中录制。我们的视频描述了非脚本化的日常活动：我们只是要求每个参与者每次进入厨房时都开始记录。来自10个不同民族的参与者在4个城市（北美和欧洲）进行了录音，从而产生了多种多样的烹饪风格。我们的数据集包含55h的视频，其中包含1150万帧，我们对其进行了密集标记，以表示总共39.6K个动作段和454.3K个对象边界框。我们的注释是独特的，因为我们让参与者对自己的视频进行叙述（在录制后），从而反映出真实的意图，然后我们基于这些视频众包地基。我们描述了我们的目标，行动和预期挑战，并通过两个测试区域（可见和不可见的厨房）评估了几个基准。

著录项

来源
《European conference on computer vision》|2018年|753-771|共19页
会议地点
作者
Dima Damen; Hazel Doughty; Giovanni Maria Farinella; Sanja Fidler; Antonino Furnari; Evangelos Kazakos; Davide Moltisanti; Jonathan Munro; Toby Perrett; Will Price; Michael Wray;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Egocentric vision; Dataset; Benchmarks First-person vision; Egocentric object detection Action recognition and anticipation;

机译：以自我为中心的视野;数据集基准第一人称视野;以自我为中心的物体检测动作识别和预期;

相似文献

外文文献
中文文献
专利

1. EGO-CH: Dataset and fundamental tasks for visitors behavioral understanding using egocentric vision [J] . Ragusa Francesco, Furnari Antonino, Battiato Sebastiano, Pattern recognition letters . 2020,第Mara期

机译：EGO-CH：使用Egocentric Vision的访客行为理解的数据集和基本任务
2. Remembering object position in the absence of vision: Egocentric, allocentric, and egocentric decentred frames of reference [J] . Coluccia E, Mammarella IC, De Beni R, Perception . 2007,第6期

机译：在没有视力的情况下记住对象的位置：以自我为中心，以自我为中心和以自我为中心的分散参考框架
3. Is that my hand? An egocentric dataset for hand disambiguation [J] . Cruz Sergio, Chan Antoni Image and Vision Computing . 2019,第Sepa期

机译：那是我的手吗？以自我为中心的数据集
4. Extending Egocentric Vision into Vehicles: Malaysian Dash-Cam Dataset [C] . Mahamat Moussa, Chern Hong Lim, KokSheik Wong International Conference on Intelligent Robotics and Applications . 2020

机译：将Egocentric视觉扩展到车辆中：马来西亚划线凸轮数据集
5. The Egocentric Orientation Scale: Different facets of egocentric cognition. [D] . Choch, Michelle. 2007

机译：以自我为中心的取向量表：以自我为中心的认知的不同方面。
6. A large-scale solar dynamics observatory image dataset for computer vision applications [O] . Ahmet Kucuk, Juan M. Banda, Rafal A. Angryk 2017

机译：用于计算机视觉应用的大规模太阳动力学天文台图像数据集
7. Multi-Dataset, Multitask Learning of Egocentric Vision Tasks [O] . Georgios Kapidis, Ronald Poppe, Remco C. Veltkamp 2021

机译：多数据集，多任务学习的Egocentric视觉任务
8. Spatial vision within egocentric and exocentric frames of reference [R] . Howard, Ian P. 1991

机译：以自我为中心和以自由为中心的参考框架内的空间视觉

Scaling Egocentric Vision: The EPIC-KITCHENS Dataset

摘要

著录项

相似文献

相关主题

期刊订阅