【24h】

Deep Learning System for Image Retrieval

机译:用于图像检索的深度学习系统

获取原文
获取外文期刊封面目录资料

摘要

In the modern era of digital photography and advent of smartphones, millions of images are generated every day and they represent precious moments and events of our lives. As we continue to add images to our digital storehouse, the management and access handling of the images becomes a daunting task and we lose track unless properly managed. We are in essential need of a tool that can fetch images based on a word or a description. In this paper, we try to build a solution that retrieves relevant images from a pool, based on the description by looking at the content of the image. The model is based on deep neural network architecture and attending to relevant parts of the image. The algorithm takes a sentence or word as input and obtains the top images which are relevant to the caption. We obtain the representation of the sentence and image in a higher dimension, which enables us to compare the two and find the similarity level of both to decide on the relevance. We have conducted various experiments to improve the representation of the image and the caption obtained in the latent space for better correlation, for e.g. use of bidirectional sequence models for better textual representation, use of various baseline convolution-based stacks for better image representation. We have also tried to incorporate the attention mechanism to focus on only the relevant parts of the image and the sentence, thereby enhancing the correlation between the two spaces.
机译:在数码摄影和智能手机问世的现代时代,每天生成数百万张图像,它们代表着我们生活中的宝贵时刻和事件。随着我们继续将图像添加到我们的数字仓库中,图像的管理和访问处理成为一项艰巨的任务,除非进行适当的管理,否则我们将失去跟踪。我们迫切需要一种可以基于单词或描述来获取图像的工具。在本文中,我们尝试构建一个解决方案,根据描述通过查看图像内容从池中检索相关图像。该模型基于深度神经网络架构,并涉及图像的相关部分。该算法将句子或单词作为输入,并获得与字幕相关的顶部图像。我们以较高的维度获得了句子和图像的表示形式,这使我们能够比较两者并找到两者的相似度来决定相关性。我们进行了各种实验,以改善图像的表示方式和在潜在空间中获得的字幕,以实现更好的相关性,例如使用双向序列模型可以更好地显示文字,可以使用各种基于基线卷积的堆栈来更好地显示图像。我们还尝试结合注意力机制,将注意力仅集中在图像和句子的相关部分,从而增强两个空间之间的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号