首页> 外文会议>European conference on computer vision >Video Summarization with Long Short-Term Memory
【24h】

Video Summarization with Long Short-Term Memory

机译:短期内记忆的视频摘要

获取原文

摘要

We propose a novel supervised learning technique for summarizing videos by automatically selecting keyframes or key subshots. Casting the task as a structured prediction problem, our main idea is to use Long Short-Term Memory (LSTM) to model the variable-range temporal dependency among video frames, so as to derive both representative and compact video summaries. The proposed model successfully accounts for the sequential structure crucial to generating meaningful video summaries, leading to state-of-the-art results on two benchmark datasets. In addition to advances in modeling techniques, we introduce a strategy to address the need for a large amount of annotated data for training complex learning approaches to summarization. There, our main idea is to exploit auxiliary annotated video summarization datasets, in spite of their heterogeneity in visual styles and contents. Specifically, we show that domain adaptation techniques can improve learning by reducing the discrepancies in the original datasets' statistical properties.
机译:我们提出了一种通过自动选择关键帧或密钥次拍摄来汇总视频的新型监督学习技术。将任务作为结构化预测问题铸造,我们的主要思想是使用长短期内存(LSTM)来模拟视频帧之间的可变时间依赖性,以导出代表性和紧凑的视频摘要。该建议的模型成功地占了发表对生成有意义的视频摘要至关重要的关键,导致两个基准数据集上的最先进的结果。除了建模技术的进步之外,我们还介绍了一种解决方案,以解决大量注释数据,以培训复杂的学习方法总结方法。在那里,我们的主要思想是利用辅助注释的视频摘要数据集,尽管它们在视觉样式和内容中的异质性。具体地,我们示出了域适应技术可以通过降低原始数据集统计特性的差异来改善学习。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号