首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning
【24h】

Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

机译:概念性字幕:用于自动图像字幕的,干净的,上位的图像替代文本数据集

获取原文

摘要

We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset (Lin et al., 2014) and represents a wider variety of both images and image caption styles. We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception-ResNet-v2 (Szegedy et al., 2016) for image-feature extraction and Transformer (Vaswani et al., 2017) for sequence modeling achieves the best performance when trained on the Conceptual Captions dataset.
机译:我们提供了一个新的图像标题注释数据集,即概念标题,它比MS-COCO数据集包含更多数量级的图像(Lin等人,2014),并代表了更多的图像和图像标题样式。我们通过从数十亿个网页中提取和过滤图像标题注释来实现此目的。我们还提出了对许多图像字幕模型的定量评估,并显示了基于Inception-ResNet-v2(Szegedy等人,2016)的图像特征提取和Transformer(Vaswani等人,2017)的序列模型架构在“概念字幕”数据集上进行训练时,建模可获得最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号