首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Unsupervised Learning from Narrated Instruction Videos
【24h】

Unsupervised Learning from Narrated Instruction Videos

机译:叙述性教学视频中的无监督学习

获取原文

摘要

We address the problem of automatically learning the main steps to complete a certain task, such as changing a car tire, from a set of narrated instruction videos. The contributions of this paper are three-fold. First, we develop a new unsupervised learning approach that takes advantage of the complementary nature of the input video and the associated narration. The method solves two clustering problems, one in text and one in video, applied one after each other and linked by joint constraints to obtain a single coherent sequence of steps in both modalities. Second, we collect and annotate a new challenging dataset of real-world instruction videos from the Internet. The dataset contains about 800,000 frames for five different tasks1 that include complex interactions between people and objects, and are captured in a variety of indoor and outdoor settings. Third, we experimentally demonstrate that the proposed method can automatically discover, in an unsupervised manner, the main steps to achieve the task and locate the steps in the input videos.
机译:我们解决了从一组旁白的教学视频中自动学习完成某些任务(例如更换汽车轮胎)的主要步骤的问题。本文的贡献是三方面的。首先,我们开发一种新的无监督学习方法,该方法利用了输入视频和相关旁白的互补性。该方法解决了两个聚类问题,一个在文本中,一个在视频中,一个接一个地应用,并受到联合约束条件的链接,以在两种方式中获得一个一致的步骤序列。其次,我们从互联网上收集并注释了一个新的具有挑战性的现实世界教学视频数据集。该数据集包含用于五种不同任务的约800,000帧图像1,其中包括人与物体之间的复杂交互,并在各种室内和室外环境中捕获。第三,我们通过实验证明了该方法可以无监督地自动发现完成任务的主要步骤,并在输入视频中定位这些步骤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号