Learning visual and textual representations for multimodal matching and classification

Liu Yu; Liu Li; Guo Yanming; Lew Michael S.

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Learning visual and textual representations for multimodal matching and classification

【24h】

Learning visual and textual representations for multimodal matching and classification

机译：学习多模式匹配和分类的视觉和文本表示

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multimodal learning has been an important and challenging problem for decades, which aims to bridge the modality gap between heterogeneous representations, such as vision and language. Unlike many current approaches which only focus on either multimodal matching or classification, we propose a unified network to jointly learn multimodal matching and classification (MMC-Net) between images and texts. The proposed MMC-Net model can seamlessly integrate the matching and classification components. It first learns visual and textual embedding features in the matching component, and then generates discriminative multimodal representations in the classification component. Combining the two components in a unified model can help in improving their performance. Moreover, we present a multi-stage training algorithm by minimizing both of the matching and classification loss functions. Experimental results on four well-known multimodal benchmarks demonstrate the effectiveness and efficiency of the proposed approach, which achieves competitive performance for multimodal matching and classification compared to state-of-the-art approaches. (C) 2018 Published by Elsevier Ltd.

机译：多式化学习是几十年的重要挑战性问题，旨在弥合异构陈述之间的模态差距，例如视觉和语言。与只关注多模式匹配或分类的许多电流方法不同，我们提出了一个统一的网络，共同学习图像和文本之间的多模式匹配和分类（MMC-Net）。所提出的MMC-Net模型可以无缝集成匹配和分类组件。它首先在匹配组件中学习Visual和Textual嵌入功能，然后在分类组件中生成判别多模式表示。结合统一模型中的两个组件可以帮助提高其性能。此外，我们通过最小化匹配和分类损失函数来提出多级训练算法。四个着名的多媒体基准测试的实验结果证明了所提出的方法的有效性和效率，与最先进的方法相比，实现了多式联运匹配和分类的竞争性能。（c）2018由elestvier有限公司出版

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society 》 |2018年第2018期| 共17页
作者
Liu Yu; Liu Li; Guo Yanming; Lew Michael S.;
展开▼
作者单位

Leiden Univ Dept Comp Sci NL-2333 CA Leiden Netherlands;

Natl Univ Def Technol Coll Syst Engn Changsha 410073 Hunan Peoples R China;

Natl Univ Def Technol Coll Syst Engn Changsha 410073 Hunan Peoples R China;

Leiden Univ Dept Comp Sci NL-2333 CA Leiden Netherlands;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术 ;
关键词
Vision and language; Multimodal matching; Multimodal classification; Deep learning;

机译：愿景和语言;多模式匹配;多式化分类;深入学习;

相似文献

外文文献
中文文献
专利

1. Learning visual and textual representations for multimodal matching and classification [J] . Liu Yu, Liu Li, Guo Yanming, Pattern Recognition: The Journal of the Pattern Recognition Society . 2018 ,第期

机译：学习多模式匹配和分类的视觉和文本表示
2. Second Language Vocabulary Learning Through Visual and Textual Representation [J] . Farzad Mashhadi, Golnaz Jamalifar Procedia - Social and Behavioral Sciences . 2015 ,第2期

机译：通过视觉和文本表示法学习第二语言词汇
3. Second Language Vocabulary Learning Through Visual and Textual Representation [J] . Farzad Mashhadi, Golnaz Jamalifar Procedia - Social and Behavioral Sciences . 2015 ,第2期

机译：通过视觉和文本表示法学习第二语言词汇
4. Evaluating Multimodal Representations on Visual Semantic Textual Similarity [C] . Oier Lopez de Lacalle, Ander Salaberria, Aitor Soroa, European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems . 2020

机译：评估视觉语义文本相似性的多模式表示
5. An Analysis of Bottom-Up Attention Models and Multimodal Representation Learning for Visual Question Answering [D] . Narayanan, Venkatraman . 2019

机译：视觉问题应答的自下而上关注模型和多式联表学习分析
6. Multimodal deep representation learning for protein interaction identification and protein family classification [O] . Da Zhang, Mansur Kabuka 2019

机译：用于蛋白质相互作用鉴定和蛋白质家族分类的多模式深度表示学习
7. Learning visual and textual representations for multimodal matching and classification [O] . Yu Liu, Li Liu, Yanming Guo, 2018

机译：学习多式联匹配和分类的视觉和文本表示

Learning visual and textual representations for multimodal matching and classification

摘要

著录项

相似文献

相关主题

期刊订阅