首页> 外文期刊>IEEE Transactions on Image Processing >User-Ranking Video Summarization With Multi-Stage Spatio–Temporal Representation
【24h】

User-Ranking Video Summarization With Multi-Stage Spatio–Temporal Representation

机译:用户排名视频摘要与多级时空表示

获取原文
获取原文并翻译 | 示例
       

摘要

Video summarization is a challenging task, mainly due to the difficulties in learning complicated semantic structural relations between videos and summaries. In this paper, we present a novel supervised video summarization scheme based on three-stage deep neural networks. The scheme takes a divide-andconquer strategy to resolve the complicated task of 3D video summarization into a set of easy and flexible computational subtasks, and then to sequentially perform 2D CNNs, 1D CNNs, and long short-term memory to address the subtasks in an hierarchical fashion. The hierarchical modeling of spatio-temporal structure leads to high performance and efficiency. In addition, we propose a simple but effective user-ranking method to cope with the labeling subjectivity problem of user-created video summarization, leading to the labeling quality refinement for robust supervised learning. Experimental results show that our approach outperforms the state-of-the-art video summarization methods on two benchmark datasets.
机译:视频摘要是一个具有挑战性的任务,主要是由于学习复杂的视频和摘要之间的语义结构关系的困难。在本文中,我们提出了一种基于三阶段深神经网络的新型监督视频摘要方案。该方案采用划分和疑问策略来解决3D视频摘要的复杂任务,进入一组简单且灵活的计算子任务,然后顺序地执行2D CNNS,1D CNN和长期短期内存以在an中寻址子任务等级时尚。时空结构的等级建模导致高性能和效率。此外,我们提出了一种简单但有效的用户排名方法来应对用户创造的视频摘要标记主观性问题,导致标签质量细化为强大的监督学习。实验结果表明,我们的方法在两个基准数据集中占此了现有的视频摘要方法。

著录项

  • 来源
    《IEEE Transactions on Image Processing》 |2019年第6期|2654-2664|共11页
  • 作者单位

    Zhejiang Univ Coll Informat Sci & Elect Engn Hangzhou 310027 Zhejiang Peoples R China;

    Zhejiang Univ Coll Comp Sci & Technol Hangzhou 310027 Zhejiang Peoples R China|Alibaba Zhejiang Univ Joint Inst Frontier Technol Hangzhou 310027 Zhejiang Peoples R China;

    Zhejiang Univ Coll Informat Sci & Elect Engn Hangzhou 310027 Zhejiang Peoples R China|SUNY Binghamton Watson Sch Comp Sci Dept Binghamton NY 13902 USA;

    Zhejiang Univ Coll Comp Sci & Technol Hangzhou 310027 Zhejiang Peoples R China;

    Northwestern Polytech Univ Sch Automat Xian 710072 Shaanxi Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Video summarization; recurrent neural network; convolutional neural network; multi-user inconsistency; user ranking;

    机译:视频摘要;经常性神经网络;卷积神经网络;多用户不一致;用户排名;
  • 入库时间 2022-08-18 20:56:06

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号