A multimodal generative and fusion framework for recognizing faculty homepages

首页> 外文期刊>Information Sciences: An International Journal >A multimodal generative and fusion framework for recognizing faculty homepages

【24h】

A multimodal generative and fusion framework for recognizing faculty homepages

机译：用于识别教师主页的多模式生成和融合框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multimodal data consist of several data modes, where each mode is a group of similar data sharing the same attributes. Recognizing faculty homepages is essentially a multimodal classification problem in which a target faculty homepage is determined from three different information sources, including text, images, and layout. Conventional strategies in previous studies have been either to concatenate features from various information sources into a compound vector or to input them separately into several different classifiers that are then assembled into a stronger classifier for the final prediction. However, both approaches ignore the connections among different feature sets. We argue that such relations are essential to enhance multimodal classification. Besides, recognizing faculty homepages is a class imbalance problem in which the total number of samples of a minority class is far smaller than the sample numbers of other classes. In this study, we propose a multimodal generative and fusion framework for multimodal learning with the problems of imbalanced data and mutually dependent feature modes. Specifically, a multimodal generative adversarial network is first introduced to rebalance the dataset by generating pseudo features based on each mode and combining them to describe a fake sample. Then, a gated fusion network with the gate and fusion mechanisms is presented to reduce the noise to improve the generalization ability and capture the links among the different feature modes. Experiments on a faculty homepage dataset show the superiority of the proposed framework. (C) 2020 Published by Elsevier Inc.

机译：多模式数据由多种数据模式组成，其中每个模式是共享相同属性的类似数据组。识别教师主页基本上是多模式分类问题，其中目标教师主页是由三种不同的信息源决定，包括文本，图像和布局。先前研究中的常规策略已经将各种信息源的特征连接到复合载体中，或者将它们分别输入到几种不同的分类器中，然后将其组装成最终预测的更强分级器。然而，两种方法都忽略了不同特征集之间的连接。我们认为这种关系对于提高多式化分类至关重要。此外，识别教师主页是一个类别不平衡问题，其中少数群体的样本总数远小于其他类的样本。在这项研究中，我们提出了一种多模式生成和融合框架，用于多模式学习，具有不平衡数据和相互依赖的特征模式的问题。具体地，首先通过基于每种模式产生伪特征并将它们组合以描述虚假样本来引入多峰生成的对抗网络以重新平衡数据集。然后，提出了一种具有栅极和融合机制的门控融合网络以减少噪声以提高泛化能力并捕获不同特征模式中的链路。教师主页数据集的实验显示了所提出的框架的优越性。（c）由elsevier公司发布的2020年

著录项

来源
《Information Sciences: An International Journal》 |2020年第2020期|共16页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;计算机的应用;信息与知识传播;自动化技术、计算机技术;
关键词
Homepages; Multimodal generative adversarial network; Gated fusion network;

机译：主页;多模式生成对抗网络;门控融合网络;

相似文献

外文文献
中文文献
专利

1. A multimodal generative and fusion framework for recognizing faculty homepages [J] . Information Sciences: An International Journal . 2020,第期

机译：用于识别教师主页的多模式生成和融合框架
2. EmotionMeter: A Multimodal Framework for Recognizing Human Emotions [J] . Zheng Wei-Long, Liu Wei, Lu Yifei, Cybernetics, IEEE Transactions on . 2019,第3期

机译：EmotionMeter：用于识别人类情绪的多模式框架
3. SCNET: A Novel UGI Cancer Screening Framework Based on Semantic-Level Multimodal Data Fusion [J] . Shuai Ding, Hui Huang, Zhenmin Li, Biomedical and Health Informatics, IEEE Journal of . 2021,第1期

机译：SCNET：基于语义级多峰数据融合的新型UGI癌症筛查框架
4. A Generative Model for Probabilistic Label Fusion of Multimodal Data [C] . Juan Eugenio Iglesias, Mert Rory Sabuncu, Koen Van Leemput International Workshop on Multimodal Brain Image Analysis . 2012

机译：多式联数据概率标签融合的生成模型
5. Bayesian sensor fusion: A framework for using multimodal sensors to estimate target locations and identities in a battlefield scene. [D] . Smith, Michael Joseph. 2003

机译：贝叶斯传感器融合：一种使用多模式传感器估算战场场景中目标位置和身份的框架。
6. Segmentation of Gliomas in Pre-operative and Post-operative Multimodal Magnetic Resonance Imaging Volumes Based on a Hybrid Generative-Discriminative Framework [O] . Ke Zeng, Spyridon Bakas, Aristeidis Sotiras, -1

机译：基于混合生成-判别框架的术前和术后多模式磁共振成像中胶质瘤的分割
7. A family of methods for quality-based multimodal biometric fusion using generative classifiers [O] . Norman Poh, Josef Kittler 2013

机译：使用生成分类器进行基于质量的多峰生物特征融合的方法系列

A multimodal generative and fusion framework for recognizing faculty homepages

摘要

著录项

相似文献

相关主题

期刊订阅