Unsupervised Temporal Attention Summarization Model for User Created Videos

机译：用户创建视频的无监督的时间关注摘要模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Unlike surveillance videos, videos created by common users contain more frequent shot changes, more diversified backgrounds, and a wider variety of content. The existing methods have two critical issues for summarizing user-created videos: 1) information distortion 2) high redundancy among keyframes. Therefore, we propose a novel temporal attention model to evaluate the importance scores of each frame. Specifically, on the basis of the classical attention model, we combine the predictions of both encoder and decoder to ensure using integrate information to score frame-level importance. Further, in order to sift redundant frames out. we devise a feedforward reward function to quantify diversity, representativeness, and storyness properties of candidate keyframes in attention model. Last, the Deep Deterministic Policy Gradient algorithm is adopted to efficiently solve the proposed formulation. Extensive experiments on the public SumMe and TVSum datasets show that our method outperforms the state of the art by a large margin in terms of the F-score.

机译：与监视视频不同，共同用户创建的视频包含更频繁的镜头变化，更多样化的背景以及更广泛的内容。现有方法具有总结用户创建视频的两个关键问题：1）信息失真2）关键帧之间的高冗余。因此，我们提出了一种新的临时注意力模型来评估每个帧的重要性分数。具体地，在古典关注模型的基础上，我们组合了编码器和解码器的预测，以确保使用集成信息来得分帧级重要性。此外，为了筛选冗余框架。我们设计了一种前馈奖励功能，以量化候选人关键框架的多样性，代表性和故事特性。最后，采用深度确定性政策梯度算法有效解决所提出的配方。关于公共夏季和TVSUM数据集的广泛实验表明，我们的方法在F分数方面，我们的方法占据了大幅度的艺术状态。

著录项

来源
《International Conference on Multimedia Modeling》|2021年|519-530|共12页
会议地点
作者
Min Hu; Ruimin Hu; Xiaocheng Wang; Rui Sheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Video summarization; Unsupervised learning; Attention mechanism;

机译：视频摘要;无人监督的学习;注意机制;

相似文献

外文文献
中文文献
专利

1. A generic framework of user attention model and its application in video summarization [J] . Yu-Fei Ma, Xian-Sheng Hua, Lie Lu, IEEE transactions on multimedia . 2005,第5期

机译：用户注意力模型的通用框架及其在视频摘要中的应用
2. Bayesian Modeling of Temporal Coherence in Videos for Entity Discovery and Summarization [J] . Adway Mitra, Soma Biswas, Chiranjib Bhattacharyya IEEE Transactions on Pattern Analysis and Machine Intelligence . 2017,第3期

机译：用于实体发现和汇总的视频中时间相干性的贝叶斯建模
3. User-Ranking Video Summarization With Multi-Stage Spatio–Temporal Representation [J] . Huang Siyu, Li Xi, Zhang Zhongfei, IEEE Transactions on Image Processing . 2019,第6期

机译：多级时空表示的用户排名视频汇总
4. Learning Temporal Co-Attention Models for Unsupervised Video Action Localization [C] . Guoqiang Gong, Xinghan Wang, Yadong Mu, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2020

机译：学习时间协同注意模型的无监督视频动作本地化
5. Video Summarization Using Unsupervised Methods [D] . Bhosale, Akanksha. 2018

机译：使用无监督方法进行视频汇总
6. Spatio-Temporal Attention Model for Foreground Detection in Cross-Scene Surveillance Videos [O] . Dong Liang, Jiaxing Pan, Han Sun, 2019

机译：跨场景监控视频中前景检测的时空注意模型
7. A User Attention Model for Video Summarization [O] . Yu-Fei Ma, Lie Lu, Hong-Jiang Zhang, 2003

机译：视频摘要的用户注意模型

Unsupervised Temporal Attention Summarization Model for User Created Videos

摘要

著录项

相似文献

相关主题

期刊订阅