ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching

机译：ACMM：对对齐的跨模型内存用于几张图像和句子匹配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image and sentence matching has drawn much attention recently, but due to the lack of sufficient pairwise data for training, most previous methods still cannot well associate those challenging pairs of images and sentences containing rarely appeared regions and words, i.e., few-shot content. In this work, we study this challenging scenario as few-shot image and sentence matching, and accordingly propose an Aligned Cross-Modal Memory (ACMM) model to memorize the rarely appeared content. Given a pair of image and sentence, the model first includes an aligned memory controller network to produce two sets of semantically-comparable interface vectors through cross-modal alignment. Then the interface vectors are used by modality-specific read and update operations to alternatively interact with shared memory items. The memory items persistently memorize cross-modal shared semantic representations, which can be addressed out to better enhance the representation of few-shot content. We apply the proposed model to both conventional and few-shot image and sentence matching tasks, and demonstrate its effectiveness by achieving the state-of-the-art performance on two benchmark datasets.

机译：图片和句子匹配已引起广泛关注最近，但由于缺乏足够的成对的数据进行训练，大部分以前的方法还不能很好的挑战对含有很少出现的区域和词，即几拍的内容图像和句子的关联。在这项工作中，我们将这一具有挑战性的场景作为少量图像和句子匹配研究，因此提出了一个对齐的跨模型存储器（ACMM）模型来记住很少出现的内容。给定一对图像和句子，该模型首先包括对准的存储器控制器网络，通过跨模型对准来产生两组语义上比较接口矢量。然后，界面向量被模态的读取和更新操作使用，以与共享存储器项交替交互。内存项目持久地记住跨模型共享语义表示，可以解决，以便更好地增强几次拍摄内容的表示。我们将建议的模型应用于传统和少量图像和句子匹配任务，并通过在两个基准数据集上实现最先进的性能来展示其有效性。

著录项

来源
《International Conference on Computer Vision》|2019年|1 v.|共10页
会议地点
作者
Yan Huang; Liang Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词
Semantics; Task analysis; Training; Visualization; Radio frequency; Pattern matching; Micromechanical devices;

机译：语义;任务数据;训练;可视化;无线电频率;模式匹配;微机械装置;

相似文献

外文文献
中文文献
专利

1. Effects of intelligibility on within- and cross-modal sentence recognition memory for native and non-native listeners [J] . Keerstock Sandie, Smiljanic Rajka The Journal of the Acoustical Society of America . 2018,第5期

机译：可懂度对本机和非本机侦听器内的跨模态句子识别记忆的影响
2. How much do cross-modal related semantics benefit image captioning by weighting attributes and re-ranking sentences? [J] . Tian Chunna, Tian Ming, Jiang Mengmeng, Pattern recognition letters . 2019,第JULa期

机译：跨模态相关的语义通过加权属性和对句子重新排序有多少好处可用于图像字幕？
3. How much do cross-modal related semantics benefit image captioning by weighting attributes and re-ranking sentences? [J] . Tian Chunna, Tian Ming, Jiang Mengmeng, Pattern recognition letters . 2019,第Jula期

机译：跨模式相关语义有多少通过加权属性和重新排名句子来效益图像标题？
4. ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching [C] . Yan Huang, Liang Wang International Conference on Computer Vision . 2019

机译：ACMM：对齐的跨模态存储器，用于少量图像和句子匹配
5. THE ROLES OF IMAGERY, LANGUAGE, AND METAMEMORY IN CROSS-MODAL TRANSFER IN CHILDREN [D] . STOLTZ-LOIKE, MARIAN ESTHER. 1984

机译：儿童跨模态转移中意象，语言和记忆的作用
6. A Cross-Modal Perspective on the Relationships between Imagery and Working Memory [O] . Lora T. Likova 2012

机译：跨模态的意象与工作记忆之间的关系
7. Few-Shot Image and Sentence Matching via Gated Visual-Semantic Embedding [O] . Yan Huang, Yang Long, Liang Wang 2019

机译：通过门控视觉语义嵌入匹配的少量图像和句子

ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching

摘要

著录项

相似文献

相关主题

期刊订阅