Visual and Semantic Feature Coordinated Bi-Lstm Model for Unsupervised Video Summarization

机译：视觉和语义特征协调为无监督视频摘要的双LSTM模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While dealing with user-created video, the prior methods suffer from the problem of high redundancy among keyframes. To address the critical issue, we present a Visual and Semantic Feature coordinated Bi-LSTM (VSFB) model for unsupervised video summarization. First, a novel Salient-Area-Size-based spatial attention model is presented to extract frame-wise visual features on the observation that humans tend to focus on sizable and moving objects. Second, the visual features are integrated with semantic features processed by Bi-LSTM to refine the frame-wise probability of being selected as keyframes. Finally, an index adjusted diversity and representativeness reward is utilized to reinforce the learning operation of the VSFB model in the video summarization. Extensive experiments demonstrate that our method outperforms state-of-the-art methods in terms of the F-score.

机译：在处理用户创建的视频时，先前的方法遭受关键帧中的高冗余问题。为了解决关键问题，我们为无监督视频摘要提供了一种视觉和语义特征协调的BI-LSTM（VSFB）模型。首先，提出了一种新的突出区域尺寸的空间注意模型，以提取框架 - 方面的视觉特征，即人类倾向于聚焦在倍大和移动物体上。其次，视觉功能与由Bi-LSTM处理的语义特征集成，以优化被选为关键帧的帧亮概率。最后，利用索引调整的分集和代表性奖励来加强视频摘要中VSFB模型的学习操作。广泛的实验表明，我们的方法在F分数方面优于最先进的方法。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2021年|1-6|共6页
会议地点
作者
Zhiqiang Hong; Rui Zhong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Computational modeling; Conferences; Semantics; Redundancy; Reinforcement learning; Feature extraction;

机译：可视化;计算建模;会议;语义;冗余;强化学习;特征提取;

相似文献

外文文献
中文文献
专利

1. Biologically Inspired Model for Visual Cognition Achieving Unsupervised Episodic and Semantic Feature Learning [J] . Hong Qiao, Yinlin Li, Fengfu Li, Cybernetics, IEEE Transactions on . 2016,第10期

机译：视觉认知的生物启发模型，实现无监督的情节和语义特征学习
2. Feature aggregation based visual attention model for video summarization [J] . Naveed Ejaz, Irfan Mehmood, Sung Wook Baik Computers and Electrical Engineering . 2014,第3期

机译：基于特征聚合的视觉注意力模型，用于视频摘要
3. ENDOSCOPY VIDEO SUMMARIZATION BASED ON MULTI-MODAL DESCRIPTORS AND POSSIBILISTIC UNSUPERVISED LEARNING AND FEATURE SUBSET WEIGHTING [J] . Mohamed Maher Ben Ismail, Ouiem Bchir, Ahmed Z. Emam Intelligent automation and soft computing . 2014,第3期

机译：基于多模态描述符和可能的无监督学习和特征子集加权的内窥镜视频汇总
4. Unsupervised learning of visual and semantic features for video summarization [C] . Yansen Huang, Rui Zhong, Wenjin Yao, IEEE International Symposium on Circuits and Systems . 2021

机译：视频摘要的视觉和语义特征的无监督学习
5. Optimization-based summarization and indexing of extended videos, with application to instructional video semantics. [D] . Liu, Tiecheng. 2003

机译：基于优化的扩展视频摘要和索引，应用于教学视频语义。
6. Visual saliency models for summarization of diagnostic hysteroscopy videos in healthcare systems [O] . Khan Muhammad, Jamil Ahmad, Muhammad Sajjad, -1

机译：可视显着性模型用于汇总医疗保健系统中的宫腔镜诊断视频
7. Video summarization with visual and semantic features [O] . Dong P, Wang Z, Zhuo L, 2010

机译：具有视觉和语义功能的视频摘要

Visual and Semantic Feature Coordinated Bi-Lstm Model for Unsupervised Video Summarization

摘要

著录项

相似文献

相关主题

期刊订阅