首页> 外文会议>Conference on Multimedia Information Processing and Retrieval >Soccer Video Summarization Using Deep Learning
【24h】

Soccer Video Summarization Using Deep Learning

机译:使用深度学习的足球视频汇总

获取原文

摘要

This paper presents a deep learning approach to summarizing long soccer videos by leveraging the spatiotemporal learning capability of three-dimensional Convolutional Neural Network (3D-CNN) and Long Short-Term Memory (LSTM) - Recurrent Neural Network (RNN). Our proposed approach involves, 1) a step-by-step development of a Residual Network (ResNet) based 3D-CNN that recognizes soccer actions, 2) manually annotating 744 soccer clips from five soccer action classes for training, and 3) training an LSTM network on soccer features extracted by the proposed ResNet based 3D-CNN. We combine the 3D-CNN and LSTM models to detect soccer highlights. To summarize a soccer match video, we model the video input as a sequential concatenation of video segments whose inclusion in a summary video production is based on its validated relevance. To evaluate the proposed summarization system, 10 soccer videos were summarized and subsequently evaluated by 48 participants polled from 8 countries using the Mean Opinion Score (MOS) scale. Collectively, the summarized videos received a 4 of 5 MOS.
机译:本文提出了一种深度学习方法,通过利用三维卷积神经网络(3D-CNN)和长短期记忆(LSTM)-递归神经网络(RNN)的时空学习能力来总结长足球视频。我们提出的方法包括:1)逐步开发基于残差网络(ResNet)的3D-CNN,该3D-CNN可以识别足球动作; 2)手动注释来自五个足球动作类的744个足球剪辑以进行训练; 3)训练拟议的基于ResNet的3D-CNN提取的关于足球功能的LSTM网络。我们结合了3D-CNN和LSTM模型来检测足球亮点。为了总结足球比赛视频,我们将视频输入建模为视频片段的顺序串联,其视频摘要中包含的内容基于其经过验证的相关性。为了评估提议的摘要系统,对10个足球视频进行了总结,然后由来自8个国家的48位参与者使用平均意见评分(MOS)量表进行了评估。汇总的视频总共收到5个MOS中的4个。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号