首页> 外文期刊>Multimedia, IEEE Transactions on >Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition
【24h】

Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition

机译:对抗脱位频谱变化和NIR-VIS面部识别的跨模型注意力网络

获取原文
获取原文并翻译 | 示例
           

摘要

Near-infrared and visual (NIR-VIS) matching task refers to the face recognition between the two images of different modalities, which remains a challenging task in the field of machine vision. The main problems of NIR-VIS Heterogeneous Face Recognition (HFR) tasks include two aspects: large intra-class differences caused by cross-modal data, and insufficient paired training samples. In this paper, an effective Adversarial Disentanglement spectrum variations and Cross-modality Attention Networks (ADCANs) is proposed for VIS-NIR matching task. Three key components are introduced to the ADCANs for reducing the gap of cross-modal images: Advanced Scatter Loss (ASL), Modality-adversarial Feature Learning (MaFL) and Cross-modality Attention Block (CmAB). The proposed ASL loss captures between- and within-class information of the data and embeds them to the network for more effective training, and it focuses on categories with small between-class distance and increases the distance between them. The MaFL consists of an Identity-Discriminative Feature Learning Network (IDFLN) and a Modality-Adversarial Disentanglement Network (MADN), which can enhance the identity-discriminative feature representations as well as disentangling spectrum variations via an adversarial learning. The IDFLN built by an end-to-end CNNs aims at learning identity-discriminative feature. While the MADN built by a discriminator $D$ and a generator $G$ focuses on removing modality-related information. Furthermore, to increase representation power as well as disentangling spectrum variations effectively, a CmAB block is introduced to the network, which sequentially applies spatial and channel attention modules to both the IDFLN and MADN. Since the channel attention module focuses on ‘what’ features to suppress or emphasize, an orthogonality constraint is introduced to the two channel attention modules, which allows MADN and IDFLN to focus on learning modality-related features and identity-related features, respectively. In particular, the ADCANs consists of multiple CmAB blocks to learn discriminative features and disentangle spectrum variations. A large number of experiments on three challenging HFR datasets indicate that the proposed ADCANs is effective for VIS-NIR HFR task.
机译:近红外和视觉(NIR-VI)匹配任务是指不同模式的两个图像之间的面部识别,这仍然是机器视野领域的具有挑战性的任务。 NIR-VI的异构面部识别(HFR)任务的主要问题包括两个方面:由跨模型数据引起的大型内部差异,并且配对训练样本不足。本文提出了一种有效的逆势脱位频谱变化和跨模型注意力网络(ADCans),用于Vis-Nir匹配任务。将三个关键组件引入到ADCANS中,用于降低跨模型图像的间隙:高级散射损失(ASL),模态 - 对抗特征学习(MAFL)和跨模型注意力块(CMAB)。所提出的ASL损失在数据的课堂内和课堂内信息之间捕获,并将它们嵌入到网络中以进行更有效的培训,并且它关注课程之间距离小的类别并增加它们之间的距离。 MAFL包括一个身份鉴别特征学习网络(IDFLN)和模态 - 对手解剖网络(MADN),其可以通过对抗学习来增强身份鉴别特征表示以及解解光谱变化。由端到端CNN构建的IDFLN旨在学习身份鉴别特征。虽然由鉴别者<内联公式XMLNS:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink “> $ d $ 和一个生成 $ g $ 侧重于删除与模块相关的信息。此外,为了有效地增加表示功率以及解解频谱变化,将CMAB块引入网络,这将空间和信道注意模块顺序地应用于IDFLN和MADN。由于通道注意力模块侧重于“抑制或强调的”功能,因此将正交的约束引入了两个通道注意模块,这允许Madn和IDFLN专注于学习模态相关的特征和身份相关的特征。特别地,ADCans由多个CMAB块组成,以学习鉴别特征和解散频谱变化。在三个挑战的HFR数据集上大量实验表明所提出的ADCans对Vis-Nir HFR任务有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号