首页> 外文期刊>電子情報通信学会技術研究報告. ITS. Intelligent Transport Systems Technology >Reinforcement Learning on Video Summarization with Hierarchical Structure
【24h】

Reinforcement Learning on Video Summarization with Hierarchical Structure

机译:钢筋综合与层次结构的概要学习

获取原文
获取原文并翻译 | 示例
           

摘要

Conventional video summarization approaches based on reinforcement learning have the problem that the reward can only be received after the whole summary is generated. Such kind of reward is sparse and it makes reinforcement learning hard to converge. Another problem is that labelling each shot is tedious and costly, which usually prohibits the construction of large-scale datasets. To solve these problems, we propose a weakly supervised hierarchical reinforcement learning framework, which decomposes the whole task into several subtasks to enhance the summarization quality. This framework consists of a manager network and a worker network. For each subtask, the manager is trained to set a subgoal only by a task-level binary label, which requires much fewer labels than conventional approaches. With the guide of the subgoal, the worker predicts the importance scores for video shots in the subtask by policy gradient according to both global reward and innovative defined sub-rewards to overcome the sparse problem. Experiments on two benchmark datasets show that our proposal has achieved the best performance, even better than supervised approaches.
机译:基于加强学习的传统视频摘要方法具有唯一在生成整个摘要之后才能接收奖励的问题。这种奖励是稀疏的,它使加强学习难以收敛。另一个问题是,每个镜头标记是繁琐且昂贵的,这通常禁止构建大规模数据集。为了解决这些问题,我们提出了一种弱监督的分层加强学习框架,它将整个任务分解为几个子特设,以提高摘要质量。该框架包括经理网络和工作网络。对于每个子任务,经理员工须培训,仅由任务级二进制标签设置子驻留,这需要比传统方法更少的标签。通过基站的指南,工作人员根据全球奖励和创新定义的子奖励,通过策略梯度预测子任务中视频射击的重要性分数来克服稀疏问题。两个基准数据集的实验表明,我们的提案取得了最佳性能,甚至比监督方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号