首页> 外文期刊>Machine Learning >Large scale image annotation: learning to rank with joint word-image embeddings
【24h】

Large scale image annotation: learning to rank with joint word-image embeddings

机译:大规模图像标注:学习使用联合词-图像嵌入进行排名

获取原文
获取原文并翻译 | 示例
           

摘要

Image annotation datasets are becoming larger and larger, with tens of millions of images and tens of thousands of possible annotations. We propose a strongly performing method that scales to such datasets by simultaneously learning to optimize precision at k of the ranked list of annotations for a given image and learning a low-dimensional joint embedding space for both images and annotations. Our method both outperforms several baseline methods and, in comparison to them, is faster and consumes less memory. We also demonstrate how our method learns an interpretable model, where annotations with alternate spellings or even languages are close in the embedding space. Hence, even when our model does not predict the exact annotation given by a human labeler, it often predicts similar annotations, a fact that we try to quantify by measuring the newly introduced "sibling" precision metric, where our method also obtains excellent results.
机译:图像注释数据集变得越来越大,具有数千万个图像和数万个可能的注释。我们提出了一种性能强大的方法,可通过同时学习优化给定图像的注释排名列表k的精度以及学习图像和注释的低维联合嵌入空间来扩展此类数据集的性能。我们的方法都优于几种基准方法,并且与之相比,速度更快且消耗的内存更少。我们还演示了我们的方法如何学习一种可解释的模型,其中具有替代拼写或什至是语言的注释在嵌入空间中很接近。因此,即使我们的模型无法预测人类标记者给出的确切注释,也常常会预测相似的注释,这是我们尝试通过测量新引入的“同级”精度度量进行量化的事实,我们的方法也获得了出色的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号