首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Cross-modal transfer with neural word vectors for image feature learning
【24h】

Cross-modal transfer with neural word vectors for image feature learning

机译:带有神经词向量的跨模式传递用于图像特征学习

获取原文

摘要

Neural word vector (NWV) such as word2vec is a powerful text representation tool that can encode extensive semantic information into compact vectors. This ability poses an interesting question in relation to image processing research - Can we learn better semantic image features from NWVs? We empirically explore this question in the context of semantic content-based image retrieval (CBIR). In this paper, we consider cross-modal transfer learning (CMT) to improve initial convolutional neural network (CNN) image features by using NWVs. We first show that NWVs can improve semantic CBIR performance compared to classical word vectors, even if it is with simple CMT models, i.e., canonical correlation analysis (CCA). Next, inspired by a characteristic property of NWVs, we propose a new CMT model and demonstrate that it can improve CBIR performance even further.
机译:神经词向量(NWV),例如word2vec,是一种功能强大的文本表示工具,可以将大量的语义信息编码为紧凑的向量。这种能力在图像处理研究方面提出了一个有趣的问题-我们能否从NWV中学习更好的语义图像特征?我们在基于语义内容的图像检索(CBIR)的上下文中经验性地探讨了这个问题。在本文中,我们考虑通过使用NWV来改进跨卷积转移神经网络(CNN)图像特征的跨模态转移学习(CMT)。我们首先表明,与经典单词向量相比,NWV可以提高语义CBIR性能,即使使用简单的CMT模型(即规范相关分析(CCA))也是如此。接下来,受NWV的特性影响,我们提出了一个新的CMT模型,并证明了它可以进一步改善CBIR性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号