【24h】

Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

机译:随着多样性 - 代表性奖励的无监督视频摘要的深度加强学习

获取原文

摘要

Video summarization aims to facilitate large-scale video browsing by producing short, concise summaries that are diverse and representative of original videos. In this paper, we formulate video summarization as a sequential decisionmaking process and develop a deep summarization network (DSN) to summarize videos. DSN predicts for each video frame a probability, which indicates how likely a frame is selected, and then takes actions based on the probability distributions to select frames, forming video summaries. To train our DSN, we propose an end-to-end, reinforcement learning-based framework, where we design a novel reward function that jointly accounts for diversity and representativeness of generated summaries and does not rely on labels or user interactions at all. During training, the reward function judges how diverse and representative the generated summaries are, while DSN strives for earning higher rewards by learning to produce more diverse and more representative summaries. Since labels are not required, our method can be fully unsupervised. Extensive experiments on two benchmark datasets show that our unsupervised method not only outperforms other state-of-the-art unsupervised methods, but also is comparable to or even superior than most of published supervised approaches.
机译:视频摘要旨在通过生产短,简洁的摘要来促进大规模的视频浏览,这些概要是不同的和代表原始视频的。在本文中,我们将视频摘要制定为连续决策过程,并开发一个深刻的摘要网络(DSN)来汇总视频。 DSN预测每个视频帧的概率,这表示选择了帧的程度,然后基于概率分布进行动作以选择帧,形成视频摘要。要培训我们的DSN,我们提出了一个端到端的加强学习的框架,在那里我们设计了一种新的奖励功能,共同考虑了所产生的摘要的多样性和代表性,并不依赖于标签或用户交互。在培训期间,奖励职能判断所产生的摘要的多样化和代表,而DSN努力通过学习产生更多样化和更具代表性的摘要来获得更高的奖励。由于不需要标签,我们的方法可以完全无监督。两个基准数据集的广泛实验表明,我们无监督的方法不仅优于其他最先进的无人监督的方法,而且比大多数发表的监督方法更优于甚至优于大多数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号