首页> 外文会议>European Conference on Computer Vision >Learning to Learn from Web Data Through Deep Semantic Embeddings
【24h】

Learning to Learn from Web Data Through Deep Semantic Embeddings

机译:学习通过Deep Semantic Embeddings从Web数据中学习

获取原文

摘要

In this paper we propose to learn a multimodal image and text embedding from Web and Social Media data, aiming to leverage the semantic knowledge learnt in the text domain and transfer it to a visual model for semantic image retrieval. We demonstrate that the pipeline can learn from images with associated text without supervision and perform a thorough analysis of five different text embeddings in three different benchmarks. We show that the embeddings learnt with Web and Social Media data have competitive performances over supervised methods in the text based image retrieval task, and we clearly outperform state of the art in the MIRFlickr dataset when training in the target data. Further we demonstrate how semantic multimodal image retrieval can be performed using the learnt embeddings, going beyond classical instance-level retrieval problems. Finally, we present a new dataset, InstaCitieslM, composed by Instagram images and their associated texts that can be used for fair comparison of image-text embeddings.
机译:在本文中,我们建议学习从网络和社交媒体数据嵌入的多模式图像和文本,旨在利用文本域中学习的语义知识并将其转移到用于语义图像检索的可视模型。我们展示了管道可以在没有监督的情况下从带有相关文本的图像中学习,并在三个不同的基准中对五种不同的文本嵌入进行彻底分析。我们展示使用Web和社交媒体数据学习的eMbeddings在基于文本的图像检索任务中具有竞争性表现,并且在目标数据中训练时,我们在Mirflickr数据集中清楚地倾斜了最优异的最新状态。此外,我们演示了如何使用学习的嵌入式执行语义多模式图像检索,超出古典实例级检索问题。最后,我们提出了一个新的数据集,Instacitieslm,由Instagram映像和它们的关联文本组成,可用于图像文本嵌入的公平比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号