Binge Watching: Scaling Affordance Learning from Sitcoms

机译：观看狂欢：从Sitcoms扩展负担能力学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, there has been a renewed interest in jointly modeling perception and action. At the core of this investigation is the idea of modeling affordances. However, when it comes to predicting affordances, even the state of the art approaches still do not use any ConvNets. Why is that? Unlike semantic or 3D tasks, there still does not exist any large-scale dataset for affordances. In this paper, we tackle the challenge of creating one of the biggest dataset for learning affordances. We use seven sitcoms to extract a diverse set of scenes and how actors interact with different objects in the scenes. Our dataset consists of more than 10K scenes and 28K ways humans can interact with these 10K images. We also propose a two-step approach to predict affordances in a new scene. In the first step, given a location in the scene we classify which of the 30 pose classes is the likely affordance pose. Given the pose class and the scene, we then use a Variational Autoencoder (VAE) [23] to extract the scale and deformation of the pose. The VAE allows us to sample the distribution of possible poses at test time. Finally, we show the importance of large-scale data in learning a generalizable and robust model of affordances.

机译：近年来，人们对重新组合感知和行动建模有了新的兴趣。此调查的核心是对能力进行建模的想法。但是，在预测可负担性时，即使是最先进的方法，仍然不会使用任何ConvNet。这是为什么？与语义或3D任务不同，仍然没有任何大型的能力数据集。在本文中，我们解决了创建最大的学习能力数据集之一的挑战。我们使用七个情景喜剧来提取各种场景，以及演员如何与场景中的不同对象进行交互。我们的数据集包含1万多个场景和28K种人类可以与这些10K图像进行交互的方式。我们还提出了一种两步法来预测新场景中的可负担能力。第一步，给定场景中的位置，我们将30个姿势类别中的哪个分类为可能的负担姿势。给定姿势类和场景，然后我们使用变分自动编码器（VAE）[23]提取姿势的比例和变形。 VAE允许我们在测试时采样可能的姿势分布。最后，我们展示了大规模数据在学习可概括性和健壮的支付能力模型中的重要性。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2017年|3366-3375|共10页
会议地点
作者
Xiaolong Wang; Rohit Girdhar; Abhinav Gupta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Videos; Three-dimensional displays; TV; Robots; Manuals;

机译：语义;视频;三维显示器;电视;机器人;手册;

相似文献

外文文献
中文文献
专利

1. Towards a cross-cultural assessment of binge-watching: Psychometric evaluation of the 'watching TV series motives' and 'binge-watching engagement and symptoms' questionnaires across nine languages [J] . Flayelle Maeva, Castro-Calvo Jesus, Voegele Claus, Computers in Human Behavior . 2020,第Octa期

机译：对狂欢观察的跨文化评估：对九种语言的“观看电视剧动机”和“狂欢观看的参与和症状”问卷的心理测量评估
2. Assessing binge-watching behaviors: Development and validation of the 'Watching TV Series Motives' and 'Binge-watching Engagement and Symptoms' questionnaires [J] . Flayelle Maeva, Canale Natale, Vogele Claus, Computers in Human Behavior . 2019,第JANa期

机译：评估狂欢观看行为：“观看电视连续剧动机”和“观看狂欢活动和症状”问卷的开发和验证
3. I hate binge-watching but I can't help doing it: The moderating effect of immediate gratification and need for cognition on binge- watching attitude-behavior relation [J] . Shim Hongjin, Lim Sohye, Jung Eunjean Elizabeth, Telematics and Informatics . 2018,第7期

机译：我讨厌暴饮暴食，但我忍不住了：即时满足的适度影响和对暴饮暴食的态度-行为关系的认知需求
4. Binge Watching: Scaling Affordance Learning from Sitcoms [C] . Xiaolong Wang, Rohit Girdhar, Abhinav Gupta IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：狂欢观看：缩放情景喜剧的广泛学习
5. Binge and Bingeability: The Antecedents and Consequences of Binge Watching Behavior [D] . Ferchaud, Arienne. 2018

机译：暴饮暴食和暴饮暴食：观看暴饮暴食行为的前因和后果
6. Binge-Watching: Development and Validation of the Binge-Watching Addiction Questionnaire [O] . Giuseppe Forte, Francesca Favieri, Domenico Tedeschi, 2021

机译：狂欢观察：狂欢观察成瘾问卷的开发和验证
7. Watch This: Scalable Cost-Function Learning for Path Planning in Urban Environments [O] . Wulfmeier, Markus, Wang, Dominic Zeng, Posner, Ingmar 2016

机译：观察：城市道路规划的可扩展成本函数学习环境

Binge Watching: Scaling Affordance Learning from Sitcoms

摘要

著录项

相似文献

相关主题

期刊订阅