Large scale image annotation: learning to rank with joint word-image embeddings

Jason Weston; Samy Bengio; Nicolas Usunier

首页> 外文期刊>Machine Learning >Large scale image annotation: learning to rank with joint word-image embeddings

【24h】

Large scale image annotation: learning to rank with joint word-image embeddings

机译：大规模图像标注：学习使用联合词-图像嵌入进行排名

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image annotation datasets are becoming larger and larger, with tens of millions of images and tens of thousands of possible annotations. We propose a strongly performing method that scales to such datasets by simultaneously learning to optimize precision at k of the ranked list of annotations for a given image and learning a low-dimensional joint embedding space for both images and annotations. Our method both outperforms several baseline methods and, in comparison to them, is faster and consumes less memory. We also demonstrate how our method learns an interpretable model, where annotations with alternate spellings or even languages are close in the embedding space. Hence, even when our model does not predict the exact annotation given by a human labeler, it often predicts similar annotations, a fact that we try to quantify by measuring the newly introduced "sibling" precision metric, where our method also obtains excellent results.

机译：图像注释数据集变得越来越大，具有数千万个图像和数万个可能的注释。我们提出了一种性能强大的方法，可通过同时学习优化给定图像的注释排名列表k的精度以及学习图像和注释的低维联合嵌入空间来扩展此类数据集的性能。我们的方法都优于几种基准方法，并且与之相比，速度更快且消耗的内存更少。我们还演示了我们的方法如何学习一种可解释的模型，其中具有替代拼写或什至是语言的注释在嵌入空间中很接近。因此，即使我们的模型无法预测人类标记者给出的确切注释，也常常会预测相似的注释，这是我们尝试通过测量新引入的“同级”精度度量进行量化的事实，我们的方法也获得了出色的结果。

著录项

来源
《Machine Learning》 |2010年第1期|p.21-35|共15页
作者
Jason Weston; Samy Bengio; Nicolas Usunier;
展开▼
作者单位

Google, New York, USA;

rnGoogle, Mountain View, USA;

rnUniversite Paris 6, LIP6, Paris, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
large scale; image annotation; learning to rank; embedding;

机译：大规模图像注释;学习排名;嵌入;

相似文献

外文文献
中文文献
专利

1. Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multiview Learning [J] . Tao Hong, Hou Chenping, Yi Dongyun, Cybernetics, IEEE Transactions on . 2021,第3期

机译：联合嵌入学习和低秩近似：不完整多视图学习的框架
2. Medical Image Annotation with a New Low-Rank Modeling-Based Multi-Label Active Learning Method [J] . Wu J., Ruan S., Lian C., Medical Physics . 2018,第6期

机译：用新的低级别建模的多标签活动学习方法的医学图像注释
3. Graph regularized low-rank feature mapping for multi-label learning with application to image annotation [J] . Feng Songhe, Lang Congyan Multidimensional systems and signal processing . 2018,第4期

机译：图表正常化的低秩特征映射，用于多标签学习，应用于图像注释
4. A Low-Rank Approximation Approach to Learning Joint Embeddings of News Stories and Images for Timeline Summarization [C] . William Yang Wang, Yashar Mehdad, Dragomir R. Radev, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：一种低秩近似方法，用于学习新闻故事和图像的联合嵌入以进行时间轴汇总
5. Image annotation and feature engineering via structural sparsity and low-rank approximation. [D] . Kong, Deguang. 2013

机译：通过结构稀疏性和低秩逼近进行图像注释和特征工程。
6. HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks [O] . Junning Gao, Lizhi Liu, Shuwei Yao, 2019

机译：HPOAnnotator：通过与HPO语义相似度和多个PPI网络的低秩近似改善HPO注释的大规模预测
7. A Low-Rank Approximation Approach to Learning Joint Embeddings of News Stories and Images for Timeline Summarization [O] . William Yang Wang, Yashar Mehdad, Dragomir R. Radev, 2016

机译：用于学习新闻报道的关节嵌入和图像的低级近似方法进行时间线概述

Large scale image annotation: learning to rank with joint word-image embeddings

摘要

著录项

相似文献

相关主题

期刊订阅