首页> 外文会议>Conference on empirical methods in natural language processing >Gaussian Visual-Linguistic Embedding for Zero-Shot Recognition
【24h】

Gaussian Visual-Linguistic Embedding for Zero-Shot Recognition

机译:高斯视觉语言嵌入零发芽识别

获取原文

摘要

An exciting outcome of research at the intersection of language and vision is that of zero-shot learning (ZSL). ZSL promises to scale visual recognition by borrowing distributed semantic models learned from linguistic corpora and turning them into visual recognition models. However the popular word-vector DSM embeddings are relatively impoverished in their expressivity as they model each word as a single vector point. In this paper we explore word-distribution embeddings for ZSL. We present a visual-linguistic mapping for ZSL in the case where words and visual categories are both represented by distributions. Experiments show improved results on ZSL benchmarks due to this better exploiting of intra-concept variability in each modality.
机译:在语言和视觉交汇处的一项令人兴奋的研究成果是零镜头学习(ZSL)。 ZSL承诺通过借用从语言语料库中学习到的分布式语义模型并将其转变为视觉识别模型来扩展视觉识别。但是,流行的词向量DSM嵌入在表达能力方面相对较差,因为它们将每个词建模为单个向量点。在本文中,我们探索了ZSL的词分布嵌入。在单词和视觉类别都由分布表示的情况下,我们为ZSL提供了视觉语言映射。实验表明,由于更好地利用了每种方式的概念内变异性,因此在ZSL基准上显示了改进的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号