Bidirectional Retrieval Made Simple

机译：双向检索使得简单

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper provides a very simple yet effective character-level architecture for learning bidirectional retrieval models. Aligning multimodal content is particularly challenging considering the difficulty in finding semantic correspondence between images and descriptions. We introduce an efficient character-level inception module, designed to learn textual semantic embeddings by convolving raw characters in distinct granularity levels. Our approach is capable of explicitly encoding hierarchical information from distinct base-level representations (e.g., characters, words, and sentences) into a shared multimodal space, where it maps the semantic correspondence between images and descriptions via a contrastive pairwise loss function that minimizes order-violations. Models generated by our approach are far more robust to input noise than state-of-the-art strategies based on word-embeddings. Despite being conceptually much simpler and requiring fewer parameters, our models outperform the state-of-the-art approaches by 4.8% in the task of description retrieval and 2.7% (absolute R@1 values) in the task of image retrieval in the popular MS COCO retrieval dataset. We also show that our models present solid performance for text classification, specially in multilingual and noisy domains.

机译：本文提供了一种非常简单而有效的字符级架构，用于学习双向检索模型。考虑难以在图像和描述之间找到语义对应的困难，对齐多模式内容尤其具有挑战性。我们介绍了一个高效的字符级inception模块，旨在通过在不同的粒度水平中卷积原始字符来学习文本语义嵌入。我们的方法能够明确地将来自不同基本级别表示的分层信息（例如，字符，单词和句子）分成共享多模式空间，在那里它通过最小化顺序的对比对损耗函数映射图像和描述之间的语义对应关系 - 膜。我们方法生成的模型对于输入噪声的更强大，而不是基于Word-Embeddings的最先进的策略。尽管参数概念性更简单，但参数较少，但我们的车型优于最先进的方法，在描述中的描述检索和2.7％（绝对R @ 1值）中的任务中的最先进的方法。 MS COCO检索数据集。我们还表明，我们的模型为文本分类提供了坚实的性能，特别是多语言和嘈杂的域。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|cxxii 7297-8023 p. :|共9页
会议地点
作者
J?natas Wehrmann; Rodrigo C. Barros;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP14-532;
关键词
Bidirectional Retrieval; Made Simple; noisy domains;

机译：双向检索;制造简单;嘈杂的域名;
入库时间 2022-08-21 03:17:32

相似文献

外文文献
中文文献
专利

1. Retrieval of aerosol optical depth over bright land surfaces by coupling bidirectional reflectance distribution function model and aerosol retrieval model [J] . JIE GUANG, YONG XUE, YINGJIE LI, Remote sensing letters . 2012,第6a8期

机译：双向反射率分布函数模型与气溶胶反演模型耦合反演亮陆表面气溶胶光学深度
2. Retrieval of aerosol optical depth over bright land surfaces by coupling bidirectional reflectance distribution function model and aerosol retrieval model [J] . Jie Guang, Hui Xu Remote Sensing Letters . 2012,第7期

机译：双向反射率分布函数模型与气溶胶反演模型耦合反演亮陆表面气溶胶光学深度
3. Bidirectional WDM self-healing ring network based on simple bidirectional add/drop amplifier modules [J] . Kim C.H., Chang-Hee Lee IEEE Photonics Technology Letters . 1998,第9期

机译：基于简单双向分插放大器模块的双向WDM自愈环网
4. Bidirectional Retrieval Made Simple [C] . Jônatas Wehrmann, Rodrigo C. Barros IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：双向检索变得简单
5. The role of protein kinase A and CREB in bidirectional behavioral plasticity of memory after retrieval [D] . Tronson, Natalie Celia 2006

机译：蛋白激酶A和CREB在检索后双向记忆行为可塑性中的作用
6. Arthroscopic Suture Retrieval: A Simple Technique That Disentangles Sutures During Retrieval [O] . Dominik C. Meyer, Georg Lajtai, Christian Gerber, 2012

机译：关节镜缝合线取回：一种简单的技术可以在检索过程中解开缝合线
7. Retrieval and validation of forest background reflectivity from daily Moderate Resolution Imaging Spectroradiometer (MODIS) bidirectional reflectance distribution function (BRDF) data across European forests [O] . Jan Pisek, Angela Erb, Lauri Korhonen, 2021

机译：从欧洲森林跨越日常分辨率成像光谱仪（MODIS）双向反射率（BRDF）数据的森林背景反射率的检索与验证

Bidirectional Retrieval Made Simple

摘要

著录项

相似文献

相关主题

期刊订阅