首页> 外文会议>International Conference on Multimedia and Expo >A FUSION SCHEME OF VISUAL AND AUDITORY MODALITIES FOR EVENT DETECTION IN SPORTS VIDEO
【24h】

A FUSION SCHEME OF VISUAL AND AUDITORY MODALITIES FOR EVENT DETECTION IN SPORTS VIDEO

机译:运动视频事件检测的视觉和听觉模式的融合方案

获取原文
获取外文期刊封面目录资料

摘要

In this paper, we propose an effective fusion scheme of visual and auditory modalities to detect events in sports video. The proposed scheme is built upon semantic shot classification, where we classify video shots into several major or interesting classes, each of which has clear semantic meanings. Among major shot classes we perform classification of the different auditory signal segments (i.e. silence, hitting ball, applause, commentator speech) with the goal of detecting events with strong semantic meaning. For instance, for tennis video, we have identified five interesting events: serve, reserve, ace, return, and score. Since we have developed a unified framework for semantic shot classification in sports videos and a set of audio mid-level representation with supervised learning methods, the proposed fusion scheme can be easily adapted to a new sports game. We are extending this fusion scheme to three additional typical sports videos: basketball, volleyball and soccer. Correctly detected sports video events will greatly facilitate further, structural and temporal analysis, such as sports video skimming, table of content, etc.
机译:在本文中,我们提出了一种有效的视觉和听觉方式的融合方案,以检测体育视频的事件。该方案建立在语义拍摄分类之上,我们将视频射击分为几个主要或有趣的类,每个类别都具有明显的语义含义。在主要拍摄类中,我们执行不同听觉信号段的分类(即沉默,击球,掌声,评论员语音),其目标是检测具有强大语义含义的事件。例如,对于网球视频,我们已经确定了五个有趣的事件:服务,储备,ace,返回和得分。由于我们在体育视频中开发了一个统一的语义拍摄分类框架和一系列具有监督学习方法的音频中级表示,所提出的融合方案可以很容易地适应新的运动游戏。我们正在将这种融合方案扩展到三个额外的典型体育视频:篮球,排球和足球。正确检测到的体育视频事件将极大地促进进一步,结构和时间分析,如运动视频撇渣,内容表等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号