首页> 外文会议>European conference on computer vision >Unsupervised Learning of Video Representations via Dense Trajectory Clustering
【24h】

Unsupervised Learning of Video Representations via Dense Trajectory Clustering

机译:通过密集的轨迹聚类无监督学习视频表示

获取原文

摘要

This paper addresses the task of unsupervised learning of representations for action recognition in videos. Previous works proposed to utilize future prediction, or other domain-specific objectives to train a network, but achieved only limited success. In contrast, in the relevant field of image representation learning, simpler, discrimination-based methods have recently bridged the gap to fully-supervised performance. We first propose to adapt two top performing objectives in this class -instance recognition and local aggregation, to the video domain. In particular, the latter approach iterates between clustering the videos in the feature space of a network and updating it to respect the cluster with a non-parametric classification loss. We observe promising performance, but qualitative analysis shows that the learned representations fail to capture motion patterns, grouping the videos based on appearance. To mitigate this issue, we turn to the heuristic-based IDT descriptors, that were manually designed to encode motion patterns in videos. We form the clusters in the IDT space, using these descriptors as a an unsupervised prior in the iterative local aggregation algorithm. Our experiments demonstrates that this approach outperform prior work on UCF101 and HMDB51 action recognition benchmarks. We also qualitatively analyze the learned representations and show that they successfully capture video dynamics.
机译:本文涉及无监督学习的视频在视频中的行动认可陈述的任务。以前的作品建议利用未来的预测,或其他具体的具体目标培训网络,但只取得了有限的成功。相比之下,在图像表示学习的相关领域,更简单,基于鉴别的方法最近遍历了完全监督的性能。我们首先建议在这个类 - 最高识别和本地聚合中调整两个顶级的表演目标,以及视频域。特别地,后一种方法迭代在网络的特征空间中的视频之间迭代并更新其以尊重与非参数分类损失的群集。我们观察有希望的性能,但定性分析表明,所学习的表示未能捕获运动模式,根据外观对视频进行分组。为了缓解此问题,我们转向基于启发式的IDT描述符,该描述是手动设计用于编码视频中的运动模式。我们在IDT空间中形成群集,使用这些描述符作为迭代本地聚合算法中的一个无人监督。我们的实验表明,这种方法在UCF101和HMDB51动作识别基准上占此胜过。我们还定性地分析了学习的表示,并表明他们成功捕获了视频动态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号