...
首页> 外文期刊>Multimedia Tools and Applications >M-VAD names: a dataset for video captioning with naming
【24h】

M-VAD names: a dataset for video captioning with naming

机译:M-VAD名称:具有命名的视频字幕的数据集

获取原文
获取原文并翻译 | 示例
           

摘要

Current movie captioning architectures are not capable of mentioning characters with their proper name, replacing them with a generic someone tag. The lack of movie description datasets with characters' visual annotations surely plays a relevant role in this shortage. Recently, we proposed to extend the M-VAD dataset by introducing such information. In this paper, we present an improved version of the dataset, namely M-VAD Names, and its semi-automatic annotation procedure. The resulting dataset contains 63 k visual tracks and 34 k textual mentions, all associated with character identities. To showcase the features of the dataset and quantify the complexity of the naming task, we investigate multimodal architectures to replace the someone tags with proper character names in existing video captions. The evaluation is further extended by testing this application on videos outside of the M-VAD Names dataset.
机译:当前的电影标题体系结构无法提及具有正确名称的字符,用泛型某人标记替换它们。缺少电影描述数据集具有字符的视觉注释肯定在此短缺中发挥着相关的作用。最近,我们建议通过介绍此类信息来扩展M-VAD数据集。在本文中,我们提出了一种改进的数据集版版本,即M-VAD名称及其半自动注释过程。生成的数据集包含63 k视觉曲目和34 k文本提及,所有与字符标识相关联。要展示数据集的功能并量化命名任务的复杂性,我们调查多峰架构以替换现有视频字幕中具有适当字符名称的某人标签。通过在M-VAD名称数据集之外的视频上测试此应用程序,进一步扩展了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号