首页> 外文会议>European conference on computer vision >Improving Image-Sentence Embeddings Using Large Weakly Annotated Photo Collections
【24h】

Improving Image-Sentence Embeddings Using Large Weakly Annotated Photo Collections

机译:使用大量带有弱注释的照片集来改善图像句子嵌入

获取原文
获取外文期刊封面目录资料

摘要

This paper studies the problem of associating images with descriptive sentences by embedding them in a common latent space. We are interested in learning such embeddings from hundreds of thousands or millions of examples. Unfortunately, it is prohibitively expensive to fully annotate this many training images with ground-truth sentences. Instead, we ask whether we can learn better image-sentence embeddings by augmenting small fully annotated training sets with millions of images that have weak and noisy annotations (titles, tags, or descriptions). After investigating several state-of-the-art scalable embedding methods, we introduce a new algorithm called Stacked Auxiliary Embedding that can successfully transfer knowledge from millions of weakly annotated images to improve the accuracy of retrieval-based image description.
机译:通过将图像与描述性句子嵌入到一个共同的潜在空间中,研究了它们与图像相关联的问题。我们有兴趣从数十万或数百万个示例中学习此类嵌入。不幸的是,用真实的句子来完全注释许多训练图像的费用实在太高了。取而代之的是,我们问我们是否可以通过使用数百万个带有弱且嘈杂的注释(标题,标签或描述)的图像来增强小的完全注释训练集来学习更好的图像句子嵌入。在研究了几种最先进的可伸缩嵌入方法之后,我们引入了一种称为堆叠辅助嵌入的新算法,该算法可以成功地从数百万个弱注释的图像中转移知识,从而提高基于检索的图像描述的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号