首页> 外文期刊>Information Sciences: An International Journal >A multimodal generative and fusion framework for recognizing faculty homepages
【24h】

A multimodal generative and fusion framework for recognizing faculty homepages

机译:用于识别教师主页的多模式生成和融合框架

获取原文
获取原文并翻译 | 示例
           

摘要

Multimodal data consist of several data modes, where each mode is a group of similar data sharing the same attributes. Recognizing faculty homepages is essentially a multimodal classification problem in which a target faculty homepage is determined from three different information sources, including text, images, and layout. Conventional strategies in previous studies have been either to concatenate features from various information sources into a compound vector or to input them separately into several different classifiers that are then assembled into a stronger classifier for the final prediction. However, both approaches ignore the connections among different feature sets. We argue that such relations are essential to enhance multimodal classification. Besides, recognizing faculty homepages is a class imbalance problem in which the total number of samples of a minority class is far smaller than the sample numbers of other classes. In this study, we propose a multimodal generative and fusion framework for multimodal learning with the problems of imbalanced data and mutually dependent feature modes. Specifically, a multimodal generative adversarial network is first introduced to rebalance the dataset by generating pseudo features based on each mode and combining them to describe a fake sample. Then, a gated fusion network with the gate and fusion mechanisms is presented to reduce the noise to improve the generalization ability and capture the links among the different feature modes. Experiments on a faculty homepage dataset show the superiority of the proposed framework. (C) 2020 Published by Elsevier Inc.
机译:多模式数据由多种数据模式组成,其中每个模式是共享相同属性的类似数据组。识别教师主页基本上是多模式分类问题,其中目标教师主页是由三种不同的信息源决定,包括文本,图像和布局。先前研究中的常规策略已经将各种信息源的特征连接到复合载体中,或者将它们分别输入到几种不同的分类器中,然后将其组装成最终预测的更强分级器。然而,两种方法都忽略了不同特征集之间的连接。我们认为这种关系对于提高多式化分类至关重要。此外,识别教师主页是一个类别不平衡问题,其中少数群体的样本总数远小于其他类的样本。在这项研究中,我们提出了一种多模式生成和融合框架,用于多模式学习,具有不平衡数据和相互依赖的特征模式的问题。具体地,首先通过基于每种模式产生伪特征并将它们组合以描述虚假样本来引入多峰生成的对抗网络以重新平衡数据集。然后,提出了一种具有栅极和融合机制的门控融合网络以减少噪声以提高泛化能力并捕获不同特征模式中的链路。教师主页数据集的实验显示了所提出的框架的优越性。 (c)由elsevier公司发布的2020年

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号