首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops >LoL-V2T: Large-Scale Esports Video Description Dataset
【24h】

LoL-V2T: Large-Scale Esports Video Description Dataset

机译:LOL-V2T:大规模的电子竞技视频描述数据集

获取原文

摘要

Esports is a fastest-growing new field with a largely online-presence, and is creating a demand for automatic domain-specific captioning tools. However, at the current time, there are few approaches that tackle the esports video description problem. In this work, we propose a large-scale dataset for esports video description, focusing on the popular game "League of Legends". The dataset, which we call LoL-V2T, is the largest video description dataset in the video game domain, and includes 9,723 clips with 62,677 captions. This new dataset presents multiple new video captioning challenges such as large amounts of domain-specific vocabulary, subtle motions with large importance, and a temporal gap between most captions and the events that occurred. In order to tackle the issue of vocabulary, we propose a masking the domain-specific words and provide additional annotations for this. In our results, we show that the dataset poses a challenge to existing video captioning approaches, and the masking can significantly improve performance. Our dataset and code is publicly available1.
机译:Esports是一个最快的新字段,主要是在线存在,并且正在为特定于自动域的标题工具创造一个需求。但是,在当前时,几乎没有解决蚀地点视频描述问题的方法。在这项工作中,我们提出了一个大规模的数据集,用于esports视频描述,重点关注流行的游戏“联盟”。我们调用lol-v2t的数据集是视频游戏域中最大的视频描述数据集,包括9,723个剪辑,其中包含62,677个标题。这个新数据集具有多个新的视频字幕挑战,例如大量的域特定词汇,具有很大的微妙动作,以及大多数标题之间的时间间隙和发生的事件。为了解决词汇问题,我们提出了一个掩蔽了域特定的单词并为此提供额外的注释。在我们的结果中,我们表明数据集对现有视频字幕方法构成挑战,并且屏蔽可以显着提高性能。我们的数据集和代码是公开可用的 1

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号