【24h】

Video Summarization Via Actionness Ranking

机译:通过动作等级进行视频汇总

获取原文

摘要

To automatically produce a brief yet expressive summary of a long video, an automatic algorithm should start by resembling the human process of summary generation. Prior work proposed supervised and unsupervised algorithms to train models for learning the underlying behavior of humans by increasing modeling complexity or craft-designing better heuristics to simulate human summary generation process. In this work, we take a different approach by analyzing a major cue that humans exploit for summary generation; the nature and intensity of actions. We empirically observed that a frame is more likely to be included in human-generated summaries if it contains a substantial amount of deliberate motion performed by an agent, which is referred to as actionness. Therefore, we hypothesize that learning to automatically generate summaries involves an implicit knowledge of actionness estimation and ranking. We validate our hypothesis by running a user study that explores the correlation between human-generated summaries and actionness ranks. We also run a consensus and behavioral analysis between human subjects to ensure reliable and consistent results. The analysis exhibits a considerable degree of agreement among subjects within obtained data and verifying our initial hypothesis. Based on the study findings, we develop a method to incorporate actionness data to explicitly regulate a learning algorithm that is trained for summary generation. We assess the performance of our approach on 4 summarization benchmark datasets, and demonstrate an evident advantage compared to state-of-the-art summarization methods.
机译:为了自动生成长视频的简短而富于表现力的摘要,一种自动算法应从类似于摘要生成的人工过程开始。先前的工作提出了有监督和无监督算法来训练模型,以通过增加建模复杂性或设计更好的启发式算法来模拟人类摘要生成过程,从而学习人类的基本行为。在这项工作中,我们通过分析人类利用摘要生成摘要的主要线索,采用了不同的方法。行动的性质和强度。我们凭经验观察到,如果框架包含由代理执行的大量故意运动,则该框架更有可能包含在人为生成的摘要中,这称为动作性。因此,我们假设学习自动生成摘要会涉及行动性估计和排名的隐性知识。我们通过进行一项用户研究来验证我们的假设,该研究探讨了人为产生的摘要与行动能力等级之间的相关性。我们还对人类受试者之间进行了共识和行为分析,以确保结果可靠且一致。该分析表明,在获得的数据中的受试者之间存在相当程度的一致性,并证实了我们的最初假设。基于研究结果,我们开发了一种合并动作数据的方法,以明确规范经过训练可生成摘要的学习算法。我们评估了我们的方法在4个汇总基准数据集上的性能,并证明了与最新汇总方法相比的明显优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号