首页> 外文会议>IEEE International Conference on Multimedia and Expo >Visual and Semantic Feature Coordinated Bi-Lstm Model for Unsupervised Video Summarization
【24h】

Visual and Semantic Feature Coordinated Bi-Lstm Model for Unsupervised Video Summarization

机译:视觉和语义特征协调为无监督视频摘要的双LSTM模型

获取原文

摘要

While dealing with user-created video, the prior methods suffer from the problem of high redundancy among keyframes. To address the critical issue, we present a Visual and Semantic Feature coordinated Bi-LSTM (VSFB) model for unsupervised video summarization. First, a novel Salient-Area-Size-based spatial attention model is presented to extract frame-wise visual features on the observation that humans tend to focus on sizable and moving objects. Second, the visual features are integrated with semantic features processed by Bi-LSTM to refine the frame-wise probability of being selected as keyframes. Finally, an index adjusted diversity and representativeness reward is utilized to reinforce the learning operation of the VSFB model in the video summarization. Extensive experiments demonstrate that our method outperforms state-of-the-art methods in terms of the F-score.
机译:在处理用户创建的视频时,先前的方法遭受关键帧中的高冗余问题。 为了解决关键问题,我们为无监督视频摘要提供了一种视觉和语义特征协调的BI-LSTM(VSFB)模型。 首先,提出了一种新的突出区域尺寸的空间注意模型,以提取框架 - 方面的视觉特征,即人类倾向于聚焦在倍大和移动物体上。 其次,视觉功能与由Bi-LSTM处理的语义特征集成,以优化被选为关键帧的帧亮概率。 最后,利用索引调整的分集和代表性奖励来加强视频摘要中VSFB模型的学习操作。 广泛的实验表明,我们的方法在F分数方面优于最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号