Dual Adversarial Disentanglement and Deep Representation Decorrelation for NIR-VIS Face Recognition

Weipeng Hu; Haifeng Hu

首页> 外文期刊>IEEE transactions on information forensics and security >Dual Adversarial Disentanglement and Deep Representation Decorrelation for NIR-VIS Face Recognition

【24h】

Dual Adversarial Disentanglement and Deep Representation Decorrelation for NIR-VIS Face Recognition

机译：NIR-VI识别的双重对抗解剖和深度代表性解剖

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The task of near-infrared and visual (NIR-VIS) face recognition refers to matching face data from different modalities, which has broad application prospects in areas such as multimedia information retrieval and criminal investigation. However, it remains a challenging task due to high intra-class variations and small-scale NIR-VIS dataset. In this paper, we propose a novel approach called Dual Adversarial Disentanglement and deep Representation Decorrelation (DADRD) to solve the NIR-VIS matching problem. In order to reduce the gap between NIR-VIS images, three key components are designed for DADRD model, including Cross-modal Margin (CmM) loss, Dual Adversarial Disentangled Variations (DADV) and Deep Representation Decorrelation (DRD). Firstly, the CmM loss captures within- and between-class information of the data, and it further reduces modality difference by a center-variation item. Secondly, the Mixed Facial Representation (MFR) layer of the backbone network is divided into three parts: the identity-related layer, the modality-related layer and the residual-related layer. The DADV is designed to reduce the intra-class variations, which consists of Adversarial Disentangled Modality Variations (ADMV) and Adversarial Disentangled Residual Variations (ADRV). Specifically, the ADMV and ADRV aim at eliminating spectrum variations and residual variations (i.e., lighting, pose, expression, occlusion, etc) respectively via an adversarial mechanism. Finally, we impose a DRD on the three decomposed features to make them irrelevant to each other, which can more effectively separate the three component information and enhance feature representations. In particular, we develop a Joint Three-stage Optimization (JTsO) strategy to effectively optimize the network. The joint formulation leads to the purification of identity information and the disentanglement of within-class variation information. Extensive experiments have been carried out on three challenging datasets, and the results demonstrate the effectiveness of our method.

机译：近红外和视觉（NIR-VI）的面部识别的任务是指来自不同模式的面部数据，在多媒体信息检索和刑事调查等领域具有广泛的应用前景。但是，由于高级内部变化和小型NIR-VI数据集，它仍然是一个具有挑战性的任务。在本文中，我们提出了一种新的方法，称为双重逆势解剖和深度代表性解剖（DADRD）来解决NIR-VIS匹配问题。为了减少NIR-VIS图像之间的差距，设计了三个关键组件用于DADRD模型，包括跨模型边缘（CMM）丢失，双对抗性解除态变量（DADV）和深度表示去译（DRD）。首先，CMM损失捕获数据的课程内和之间的级信息，并且它进一步降低了中心变化项的模态差。其次，骨干网的混合面部表示（MFR）层被分成三个部分：相同相关层，模态相关层和残余相关层。 DAPD旨在降低阶级内变异，由对抗性解剖模型变异（ADMV）和对抗性解除戒除的残留变异（ADRV）组成。具体地，ADMV和ADRV分别通过对抗机制消除光谱变化和残留变化（即，点亮，姿势，表达，闭塞等）。最后，我们对三个分解特征强加了DRD，使它们彼此无关紧要，这可以更有效地分离三个组件信息和增强特征表示。特别是，我们开发联合三阶段优化（JTSO）策略，以有效优化网络。联合配方导致身份信息的纯化和课堂内变异信息的解剖学。在三个挑战性数据集中进行了广泛的实验，结果证明了我们方法的有效性。

著录项

来源
《IEEE transactions on information forensics and security》 |2021年第2021期|70-85|共16页
作者
Weipeng Hu; Haifeng Hu;
展开▼
作者单位

School of Electronics and Information Technology Sun Yat-sen University Guangzhou China;

School of Electronics and Information Technology Sun Yat-sen University Guangzhou China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
NIR-VIS matching task; cross-modal margin loss; dual adversarial disentangled variations; deep representation decorrelation;

机译：NIR-VIS匹配任务;跨模型边缘损失;双重逆势解散变异;深度代表性解除络;

相似文献

外文文献
中文文献
专利

1. Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition [J] . Weipeng Hu, Haifeng Hu Multimedia, IEEE Transactions on . 2021,第1期

机译：对抗脱位频谱变化和NIR-VIS面部识别的跨模型注意力网络
2. Appearance-invariant place recognition by adversarially learning disentangled representation [J] . Qin Cao, Zhang Yunzhou, Liu Yan, Robotics and Autonomous Systems . 2020,第1期

机译：外观 - 不变地识别通过对抗外面的解散代表
3. D2SC-GAN: Dual Deep-Shallow Channeled Generative Adversarial Network, for Resolving Low-Resolution Faces for Recognition in Classroom Scenarios [J] . Avishek Bhattacharjee, Sukhendu Das IEEE Transactions on Biometrics, Behavior, and Identity Science . 2020,第3期

机译：D2SC-GaN：双层深浅通道的生成对策网络，用于解决课堂情景中识别的低分辨率面
4. Learning Invariant Deep Representation for NIR-VIS Face Recognition [C] . Ran He, Xiang Wu, Zhenan Sun, AAAI Conference on Artificial Intelligence . 2017

机译：学习不变性的NIR-VIS面部识别的深度表示
5. A Deeper Look at the Unsupervised Learning of Disentangled Representations in β-Vae from the Perspective of Core Object Recognition [D] . Sikka, Harshvardhan Digvijay. 2020

机译：从核心对象识别的角度来看，更深入地看看β-VAE中的解散表示的无情的学习
6. MichiGAN: sampling from disentangled representations of single-cell data using generative adversarial networks [O] . Hengshi Yu, Joshua D. Welch 2021

机译：密歇根州：使用生成的对抗网络从单细胞数据的脱屑表示抽样
7. Appearance-invariant place recognition by adversarially learning disentangled representation [O] . Cao Qin, Yunzhou Zhang, Yan Liu, 2020

机译：外观 - 不变地学习解除不印章代表的不变性地识别

Dual Adversarial Disentanglement and Deep Representation Decorrelation for NIR-VIS Face Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅