首页> 外文会议>IEEE International Conference and Workshops on Automatic Face and Gesture Recognition >Sports Videos in the Wild (SVW): A video dataset for sports analysis
【24h】

Sports Videos in the Wild (SVW): A video dataset for sports analysis

机译:野外运动视频(SVW):用于运动分析的视频数据集

获取原文

摘要

Considering the enormous creation rate of usergenerated videos on websites like YouTube, there is an immediate need for automatic categorization, recognition and analysis of videos. To develop algorithms for analyzing user-generated videos, unconstrained and representative datasets are of great significance. For this purpose, we collected a dataset of Sports Videos in the Wild (SVW), consisting of videos captured by users of the leading sports training mobile app (Coach's Eye) while practicing a sport or watching a game. The dataset contains 4100 videos selected by reviewing ~85,000 videos and consists of 30 sports categories and 44 actions. Videos of sports practice, which frequently happens outside the typical sports field, have huge intra-class variations due to background clutter, unrepresentative environment, existence of different training equipment and most importantly, imperfect actions. On the other hand, using smartphones for video capturing by ordinary people, in comparison to videos captured by professional crew for broadcasting, leads to challenges due to camera vibration and motion, occlusion, view point variation, and poor illumination. Given various manual labels, this dataset can be used for a wide range of computer vision applications, such as action recognition, action detection, genre categorization, and spatio-temporal alignment. On the sport genre categorization problem, we design the evaluation protocol and evaluate three different methods to provide baselines for future works.
机译:考虑到用户生成的视频在YouTube等网站上的巨大创造速度,迫切需要对视频进行自动分类,识别和分析。为了开发用于分析用户生成的视频的算法,无约束且具有代表性的数据集具有重要意义。为此,我们收集了野外运动视频(SVW)数据集,其中包括领先的运动训练移动应用程序(教练之眼)的用户在练习运动或观看比赛时捕获的视频。该数据集包含4100个视频,这些视频是通过查看约85,000个视频而选择的,并且包含30个运动类别和44个动作。运动练习的视频通常发生在典型的运动场之外,由于背景混乱,代表性不强的环境,存在不同的训练设备以及最重要的是动作不完善,导致类内差异很大。另一方面,与专业工作人员拍摄的视频相比,使用智能手机拍摄普通人的视频会导致由于相机振动和运动,遮挡,视点变化和照明不佳而带来的挑战。给定各种手动标签,此数据集可用于各种计算机视觉应用程序,例如动作识别,动作检测,体裁分类和时空对齐。在体育体裁分类问题上,我们设计了评估协议并评估了三种不同的方法,为将来的工作提供了基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号