M-VAD names: a dataset for video captioning with naming

Pini Stefano; Cornia Marcella; Bolelli Federico; Baraldi Lorenzo; Cucchiara Rita

首页> 外文期刊>Multimedia Tools and Applications >M-VAD names: a dataset for video captioning with naming

【24h】

M-VAD names: a dataset for video captioning with naming

机译：M-VAD名称：具有命名的视频字幕的数据集

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current movie captioning architectures are not capable of mentioning characters with their proper name, replacing them with a generic someone tag. The lack of movie description datasets with characters' visual annotations surely plays a relevant role in this shortage. Recently, we proposed to extend the M-VAD dataset by introducing such information. In this paper, we present an improved version of the dataset, namely M-VAD Names, and its semi-automatic annotation procedure. The resulting dataset contains 63 k visual tracks and 34 k textual mentions, all associated with character identities. To showcase the features of the dataset and quantify the complexity of the naming task, we investigate multimodal architectures to replace the someone tags with proper character names in existing video captions. The evaluation is further extended by testing this application on videos outside of the M-VAD Names dataset.

机译：当前的电影标题体系结构无法提及具有正确名称的字符，用泛型某人标记替换它们。缺少电影描述数据集具有字符的视觉注释肯定在此短缺中发挥着相关的作用。最近，我们建议通过介绍此类信息来扩展M-VAD数据集。在本文中，我们提出了一种改进的数据集版版本，即M-VAD名称及其半自动注释过程。生成的数据集包含63 k视觉曲目和34 k文本提及，所有与字符标识相关联。要展示数据集的功能并量化命名任务的复杂性，我们调查多峰架构以替换现有视频字幕中具有适当字符名称的某人标签。通过在M-VAD名称数据集之外的视频上测试此应用程序，进一步扩展了评估。

著录项

来源
《Multimedia Tools and Applications》 |2019年第10期|14007-14027|共21页
作者
Pini Stefano; Cornia Marcella; Bolelli Federico; Baraldi Lorenzo; Cucchiara Rita;
展开▼
作者单位

Univ Modena & Reggio Emilia Dept Engn Enzo Ferrari Modena Italy;

Univ Modena & Reggio Emilia Dept Engn Enzo Ferrari Modena Italy;

Univ Modena & Reggio Emilia Dept Engn Enzo Ferrari Modena Italy;

Univ Modena & Reggio Emilia Dept Engn Enzo Ferrari Modena Italy;

Univ Modena & Reggio Emilia Dept Engn Enzo Ferrari Modena Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Video captioning; Naming; Dataset; Deep learning;

机译：视频标题;命名;数据集;深度学习;

相似文献

外文文献
中文文献
专利

1. M-VAD names: a dataset for video captioning with naming [J] . Pini Stefano, Cornia Marcella, Bolelli Federico, Multimedia Tools and Applications . 2019,第10期

机译：M-VAD名称：用于命名视频字幕的数据集
2. Biomedical named entity recognition and linking datasets: survey and our recent development [J] . Ming-Siang Huang, Po-Ting Lai, Pei-Yen Lin, Briefings in bioinformatics . 2020,第6期

机译：生物医学命名实体识别和链接数据集：调查和我们最近的发展
3. Interlinking SciGraph and DBpedia Datasets Using Link Discovery and Named Entity Recognition Techniques [J] . Beyza Yaman, Michele Pasin, Markus Freudenberg OASIcs : OpenAccess Series in Informatics . 2019,第1期

机译：使用链接发现和命名实体识别技术互连SciGraph和DBpedia数据集
4. Towards Video Captioning with Naming: A Novel Dataset and a Multi-modal Approach [C] . Stefano Pini, Marcella Cornia, Lorenzo Baraldi, International conference on image analysis and processing . 2017

机译：使用命名的视频字幕：一种新颖的数据集和一种多模式方法
5. Distributed Dataset Synchronization in Named Data Networking [D] . Shang, Wentao. 2017

机译：命名数据网络中的分布式数据集同步
6. Liberating links between datasets using lightweight data publishing: an example using plant names and the taxonomic literature [O] . Roderic Page 2018

机译：使用轻量级数据发布来解放数据集之间的链接：使用植物名称和分类文献的示例
7. M-VAD names: a dataset for video captioning with naming [O] . Stefano Pini, Marcella Cornia, Federico Bolelli, 2018

机译：M-VAD名称：具有命名的视频字幕的数据集

M-VAD names: a dataset for video captioning with naming

摘要

著录项

相似文献

相关主题

期刊订阅