首页> 外文会议>Asian Conference on Computer Vision >Aligning Salient Objects to Queries: A Multi-modal and Multi-object Image Retrieval Framework

【24h】

Aligning Salient Objects to Queries: A Multi-modal and Multi-object Image Retrieval Framework

机译：将突出对象对齐至查询：多模态和多对象图像检索框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose an approach for multi-modal image retrieval in multi-labelled images. A multi-modal deep network architecture is formulated to jointly model sketches and text as input query modalities into a common embedding space, which is then further aligned with the image feature space. Our architecture also relies on a salient object detection through a supervised LSTM-based visual attention model learned from convolutional features. Both the alignment between the queries and the image and the supervision of the attention on the images are obtained by generalizing the Hungarian Algorithm using different loss functions. This permits encoding the object-based features and its alignment with the query irrespective of the availability of the co-occurrence of different objects in the training set. We validate the performance of our approach on standard single/multi-object datasets, showing state-of-the art performance in every dataset.

机译：在本文中，我们提出了一种在多标记图像中的多模态图像检索方法。配制多模态深网络架构以共同模拟草图和文本作为输入查询模态进入公共嵌入空间，然后与图像特征空间进一步对齐。我们的体系结构还依赖于通过从卷积功能中学到的基于监督的基于LSTM的视觉注意力的对象检测。通过使用不同损失函数概括匈牙利算法来获得查询和图像之间的对准以及图像上的注意力。这允许编码基于对象的特征及其与查询对齐，而不管训练集中的不同对象的共同发生的可用性。我们验证了我们在标准单/多对象数据集中的方法的性能，在每个数据集中显示最先进的性能。

著录项

来源
《Asian Conference on Computer Vision》|2019年|715p|共15页
会议地点
作者
Sounak Dey; Anjan Dutta; Suman K. Ghosh; Ernest Valveny; Josep Llados; Umapada Pal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. Deep Metric Learning for Multi-Label and Multi-Object Image Retrieval [J] . Jonathan MOJOO, Takio KURITA IEICE transactions on information and systems . 2021,第6期

机译：多标签和多对象图像检索的深度度量学习
2. Real-time Multi-object Face Recognition Using Content Based Image Retrieval (CBIR) [J] . Muhammad Fachrurrozi, Saparudin Saparudin, Erwin Erwin, International Journal of Electrical and Computer Engineering . 2018,第5期

机译：使用基于内容的图像检索（CBIR）进行实时多目标人脸识别
3. Experimental analysis of SIFT and SURF features for multi-object image retrieval [J] . H. Kavitha, M.V. Sudhamani International journal of computational vision and robotics . 2017,第3期

机译：多物体图像检索的筛选和冲浪特征的实验分析
4. Aligning Salient Objects to Queries: A Multi-modal and Multi-object Image Retrieval Framework [C] . Sounak Dey, Anjan Dutta, Suman K. Ghosh, Asian Conference on Computer Vision . 2019

机译：将突出对象对齐至查询：多模态和多对象图像检索框架
5. Multi-object shape retrieval using curvature trees [D] . Alajlan, Naif 2007

机译：使用曲率树的多对象形状检索
6. A Query Expansion Framework in Image Retrieval Domain Based on Local and Global Analysis [O] . M. M. Rahman, S. K. Antani, G. R. Thoma -1

机译：基于本地和全局分析的图像检索域中的查询扩展框架
7. Integrated Querying of Images by Color, Shape, and Texture Content of Salient Objects [O] . Ediz Saykol, Ugur Güdükbay, Özgür Ulusoy 2008

机译：通过显着对象的颜色，形状和纹理内容对图像进行综合查询

Aligning Salient Objects to Queries: A Multi-modal and Multi-object Image Retrieval Framework

摘要

著录项

相似文献

相关主题

期刊订阅