Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

机译：随着多样性 - 代表性奖励的无监督视频摘要的深度加强学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video summarization aims to facilitate large-scale video browsing by producing short, concise summaries that are diverse and representative of original videos. In this paper, we formulate video summarization as a sequential decisionmaking process and develop a deep summarization network (DSN) to summarize videos. DSN predicts for each video frame a probability, which indicates how likely a frame is selected, and then takes actions based on the probability distributions to select frames, forming video summaries. To train our DSN, we propose an end-to-end, reinforcement learning-based framework, where we design a novel reward function that jointly accounts for diversity and representativeness of generated summaries and does not rely on labels or user interactions at all. During training, the reward function judges how diverse and representative the generated summaries are, while DSN strives for earning higher rewards by learning to produce more diverse and more representative summaries. Since labels are not required, our method can be fully unsupervised. Extensive experiments on two benchmark datasets show that our unsupervised method not only outperforms other state-of-the-art unsupervised methods, but also is comparable to or even superior than most of published supervised approaches.

机译：视频摘要旨在通过生产短，简洁的摘要来促进大规模的视频浏览，这些概要是不同的和代表原始视频的。在本文中，我们将视频摘要制定为连续决策过程，并开发一个深刻的摘要网络（DSN）来汇总视频。 DSN预测每个视频帧的概率，这表示选择了帧的程度，然后基于概率分布进行动作以选择帧，形成视频摘要。要培训我们的DSN，我们提出了一个端到端的加强学习的框架，在那里我们设计了一种新的奖励功能，共同考虑了所产生的摘要的多样性和代表性，并不依赖于标签或用户交互。在培训期间，奖励职能判断所产生的摘要的多样化和代表，而DSN努力通过学习产生更多样化和更具代表性的摘要来获得更高的奖励。由于不需要标签，我们的方法可以完全无监督。两个基准数据集的广泛实验表明，我们无监督的方法不仅优于其他最先进的无人监督的方法，而且比大多数发表的监督方法更优于甚至优于大多数。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence》|2018年|6665-7655p|共8页
会议地点
作者
Kaiyang Zhou; Yu Qiao; Tao Xiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Using independently recurrent networks for reinforcement learning based unsupervised video summarization [J] . Yaliniz Gokhan, Ikizler-Cinbis Nazli Multimedia Tools and Applications . 2021,第12期

机译：基于无监督的视频概述的基于钢筋学习的独立反复网络
2. Crowd aware summarization of surveillance videos by deep reinforcement learning [J] . Junfeng Xu, Zhengxing Sun, Chen Ma Multimedia Tools and Applications . 2021,第4期

机译：通过深度加强学习，人群意识到监视视频的概述
3. Enhancement of Single Document Text Summarization using Reinforcement Learning with Non-Deterministic Rewards [J] . K.Karpagam, A. Saradha, K. Manikandan, International Journal of Information Technology and Computer Science . 2020,第4期

机译：利用非确定性奖励的强化学习提高单一文献摘要
4. Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward [C] . Kaiyang Zhou, Yu Qiao, Tao Xiang AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：随着多样性 - 代表性奖励的无监督视频摘要的深度加强学习
5. Deep Reinforcement Learning with Accelerated Reward Function Technique for Robotics Task Planning [D] . Shaikh, Shifa. 2021

机译：机器人任务规划加速奖励功能技术的深增强学习
6. Interp-SUM: Unsupervised Video Summarization with Piecewise Linear Interpolation [O] . Ui-Nyoung Yoon, Myung-Duk Hong, Geun-Sik Jo 2021

机译：Interp-Sum：具有分段线性插值的无监督视频摘要
7. Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization [O] . Siyao Li, Deren Lei, Pengda Qin, 2019

机译：具有分布语义奖励的深度加强学习，用于抽象摘要

Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

摘要

著录项

相似文献

相关主题

期刊订阅