Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition

Weipeng Hu; Haifeng Hu

首页> 外文期刊>Multimedia, IEEE Transactions on >Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition

【24h】

Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition

机译：对抗脱位频谱变化和NIR-VIS面部识别的跨模型注意力网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Near-infrared and visual (NIR-VIS) matching task refers to the face recognition between the two images of different modalities, which remains a challenging task in the field of machine vision. The main problems of NIR-VIS Heterogeneous Face Recognition (HFR) tasks include two aspects: large intra-class differences caused by cross-modal data, and insufficient paired training samples. In this paper, an effective Adversarial Disentanglement spectrum variations and Cross-modality Attention Networks (ADCANs) is proposed for VIS-NIR matching task. Three key components are introduced to the ADCANs for reducing the gap of cross-modal images: Advanced Scatter Loss (ASL), Modality-adversarial Feature Learning (MaFL) and Cross-modality Attention Block (CmAB). The proposed ASL loss captures between- and within-class information of the data and embeds them to the network for more effective training, and it focuses on categories with small between-class distance and increases the distance between them. The MaFL consists of an Identity-Discriminative Feature Learning Network (IDFLN) and a Modality-Adversarial Disentanglement Network (MADN), which can enhance the identity-discriminative feature representations as well as disentangling spectrum variations via an adversarial learning. The IDFLN built by an end-to-end CNNs aims at learning identity-discriminative feature. While the MADN built by a discriminator

$D$

and a generator

$G$

focuses on removing modality-related information. Furthermore, to increase representation power as well as disentangling spectrum variations effectively, a CmAB block is introduced to the network, which sequentially applies spatial and channel attention modules to both the IDFLN and MADN. Since the channel attention module focuses on ‘what’ features to suppress or emphasize, an orthogonality constraint is introduced to the two channel attention modules, which allows MADN and IDFLN to focus on learning modality-related features and identity-related features, respectively. In particular, the ADCANs consists of multiple CmAB blocks to learn discriminative features and disentangle spectrum variations. A large number of experiments on three challenging HFR datasets indicate that the proposed ADCANs is effective for VIS-NIR HFR task.

机译：近红外和视觉（NIR-VI）匹配任务是指不同模式的两个图像之间的面部识别，这仍然是机器视野领域的具有挑战性的任务。 NIR-VI的异构面部识别（HFR）任务的主要问题包括两个方面：由跨模型数据引起的大型内部差异，并且配对训练样本不足。本文提出了一种有效的逆势脱位频谱变化和跨模型注意力网络（ADCans），用于Vis-Nir匹配任务。将三个关键组件引入到ADCANS中，用于降低跨模型图像的间隙：高级散射损失（ASL），模态 - 对抗特征学习（MAFL）和跨模型注意力块（CMAB）。所提出的ASL损失在数据的课堂内和课堂内信息之间捕获，并将它们嵌入到网络中以进行更有效的培训，并且它关注课程之间距离小的类别并增加它们之间的距离。 MAFL包括一个身份鉴别特征学习网络（IDFLN）和模态 - 对手解剖网络（MADN），其可以通过对抗学习来增强身份鉴别特征表示以及解解光谱变化。由端到端CNN构建的IDFLN旨在学习身份鉴别特征。虽然由鉴别者<内联公式XMLNS：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink “> $ d $ 和一个生成

$ g $ 侧重于删除与模块相关的信息。此外，为了有效地增加表示功率以及解解频谱变化，将CMAB块引入网络，这将空间和信道注意模块顺序地应用于IDFLN和MADN。由于通道注意力模块侧重于“抑制或强调的”功能，因此将正交的约束引入了两个通道注意模块，这允许Madn和IDFLN专注于学习模态相关的特征和身份相关的特征。特别地，ADCans由多个CMAB块组成，以学习鉴别特征和解散频谱变化。在三个挑战的HFR数据集上大量实验表明所提出的ADCans对Vis-Nir HFR任务有效。

著录项

来源
《Multimedia, IEEE Transactions on》 |2021年第1期|145-160|共16页
作者
Weipeng Hu; Haifeng Hu;
展开▼
作者单位

School of Electronics and Information Technology Sun Yat-sen University Guangzhou China;

School of Electronics and Information Technology Sun Yat-sen University Guangzhou China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Task analysis; Feature extraction; Face; Training; Face recognition; Visualization; Deep learning;

机译：任务分析;特征提取;面部;培训;人脸识别;可视化;深入学习;

相似文献

外文文献
中文文献
专利

1. Dual Adversarial Disentanglement and Deep Representation Decorrelation for NIR-VIS Face Recognition [J] . Weipeng Hu, Haifeng Hu IEEE transactions on information forensics and security . 2021,第期

机译：NIR-VI识别的双重对抗解剖和深度代表性解剖
2. Disentangled Spectrum Variations Networks for NIR–VIS Face Recognition [J] . Hu Weipeng, Hu Haifeng IEEE transactions on multimedia . 2020,第5期

机译：Disentangled频谱变化网络用于NIR-VIS面部识别
3. Spatial Attention Gated Variational Autoencoder Enhanced Cycle-Consistent Generative Adversarial Networks for MRI to CT Translation [J] . Kearney V., Zeimer B. P., Perry A., International Journal of Radiation Oncology, Biology, Physics . 2019,第1Suppla期

机译：空间关注改变变分性AutoEncoder增强的循环一致的生成对冲网络，用于MRI到CT翻译
4. Facial Expression Recognition: Disentangling Expression Based on Self-attention Conditional Generative Adversarial Nets [C] . Haohao Li, Qiong Liu, Xiaoming Wei, Chinese conference on pattern recognition and computer vision . 2019

机译：面部表情识别：基于自注意条件生成对抗网络的表情纠缠
5. American Sign Language Recognition using Adversarial Learning in a Multi-Frequency RF Sensor Network [D] . Macks, Trevor. 2020

机译：在多频RF传感器网络中使用对抗性学习的美国手语识别
6. MichiGAN: sampling from disentangled representations of single-cell data using generative adversarial networks [O] . Hengshi Yu, Joshua D. Welch 2021

机译：密歇根州：使用生成的对抗网络从单细胞数据的脱屑表示抽样
7. Spatial Attention Gated Variational Autoencoder Enhanced Cycle-Consistent Generative Adversarial Networks for MRI to CT Translation [O] . V. Kearney, B.P. Zeimer, A. Perry, 2019

机译：空间关注改变变分性AutoEncoder增强的循环一致的生成对冲网络，用于MRI到CT翻译

Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition

摘要

著录项

相似文献

相关主题

期刊订阅