Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

机译：概念性字幕：用于自动图像字幕的，干净的，上位的图像替代文本数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset (Lin et al., 2014) and represents a wider variety of both images and image caption styles. We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception-ResNet-v2 (Szegedy et al., 2016) for image-feature extraction and Transformer (Vaswani et al., 2017) for sequence modeling achieves the best performance when trained on the Conceptual Captions dataset.

机译：我们提供了一个新的图像标题注释数据集，即概念标题，它比MS-COCO数据集包含更多数量级的图像（Lin等人，2014），并代表了更多的图像和图像标题样式。我们通过从数十亿个网页中提取和过滤图像标题注释来实现此目的。我们还提出了对许多图像字幕模型的定量评估，并显示了基于Inception-ResNet-v2（Szegedy等人，2016）的图像特征提取和Transformer（Vaswani等人，2017）的序列模型架构在“概念字幕”数据集上进行训练时，建模可获得最佳性能。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|2556-2565|共10页
会议地点
作者
Piyush Sharma; Nan Ding; Sebastian Goodman; Radu Soricut;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images [J] . Soft computing: A fusion of foundations, methodologies and applications . 2020,第2期

机译：从图像中集成Word Embeddings和Syntactic树的小说模型
2. Automatic Caption Generation for News Images [J] . Feng Yansong, Lapata Mirella Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2013,第4期

机译：自动为新闻图像生成字幕
3. Automatic indexing and content-based retrieval of captioned images [J] . Srihari R.K. Computer . 1995,第9期

机译：自动索引和基于内容的字幕图像检索
4. Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning [C] . Piyush Sharma, Nan Ding, Sebastian Goodman, Annual meeting of the Association for Computational Linguistics . 2018

机译：概念标题：用于自动图像字幕的清洁，过度的图像ALT-TEXT DataSet
5. Image Captioning: A Survey of Existing Issues on Datasets, Evaluation Metrics and Methods [D] . zhou, liwan . 2020

机译：图像字幕：对数据集的现有问题，评估度量和方法的调查
6. Caption-based topical descriptors for microscopic images of breast neoplasms as published in academic papers [O] . Sujin Kim, Shannon Lamkin, Pam Duncan -1

机译：中发表的学术论文对乳腺肿瘤的显微图像基于带字幕的局部描述符
7. STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset [O] . Yoshikawa, Yuya, Shigeto, Yutaro, Takeuchi, Akikazu 2017

机译：sTaIR字幕：构建大型日文图像标题数据集

Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

摘要

著录项

相似文献

相关主题

期刊订阅