...
首页> 外文期刊>IEEE transactions on information forensics and security >Dual Adversarial Disentanglement and Deep Representation Decorrelation for NIR-VIS Face Recognition
【24h】

Dual Adversarial Disentanglement and Deep Representation Decorrelation for NIR-VIS Face Recognition

机译:NIR-VI识别的双重对抗解剖和深度代表性解剖

获取原文
获取原文并翻译 | 示例
           

摘要

The task of near-infrared and visual (NIR-VIS) face recognition refers to matching face data from different modalities, which has broad application prospects in areas such as multimedia information retrieval and criminal investigation. However, it remains a challenging task due to high intra-class variations and small-scale NIR-VIS dataset. In this paper, we propose a novel approach called Dual Adversarial Disentanglement and deep Representation Decorrelation (DADRD) to solve the NIR-VIS matching problem. In order to reduce the gap between NIR-VIS images, three key components are designed for DADRD model, including Cross-modal Margin (CmM) loss, Dual Adversarial Disentangled Variations (DADV) and Deep Representation Decorrelation (DRD). Firstly, the CmM loss captures within- and between-class information of the data, and it further reduces modality difference by a center-variation item. Secondly, the Mixed Facial Representation (MFR) layer of the backbone network is divided into three parts: the identity-related layer, the modality-related layer and the residual-related layer. The DADV is designed to reduce the intra-class variations, which consists of Adversarial Disentangled Modality Variations (ADMV) and Adversarial Disentangled Residual Variations (ADRV). Specifically, the ADMV and ADRV aim at eliminating spectrum variations and residual variations (i.e., lighting, pose, expression, occlusion, etc) respectively via an adversarial mechanism. Finally, we impose a DRD on the three decomposed features to make them irrelevant to each other, which can more effectively separate the three component information and enhance feature representations. In particular, we develop a Joint Three-stage Optimization (JTsO) strategy to effectively optimize the network. The joint formulation leads to the purification of identity information and the disentanglement of within-class variation information. Extensive experiments have been carried out on three challenging datasets, and the results demonstrate the effectiveness of our method.
机译:近红外和视觉(NIR-VI)的面部识别的任务是指来自不同模式的面部数据,在多媒体信息检索和刑事调查等领域具有广泛的应用前景。但是,由于高级内部变化和小型NIR-VI数据集,它仍然是一个具有挑战性的任务。在本文中,我们提出了一种新的方法,称为双重逆势解剖和深度代表性解剖(DADRD)来解决NIR-VIS匹配问题。为了减少NIR-VIS图像之间的差距,设计了三个关键组件用于DADRD模型,包括跨模型边缘(CMM)丢失,双对抗性解除态变量(DADV)和深度表示去译(DRD)。首先,CMM损失捕获数据的课程内和之间的级信息,并且它进一步降低了中心变化项的模态差。其次,骨干网的混合面部表示(MFR)层被分成三个部分:相同相关层,模态相关层和残余相关层。 DAPD旨在降低阶级内变异,由对抗性解剖模型变异(ADMV)和对抗性解除戒除的残留变异(ADRV)组成。具体地,ADMV和ADRV分别通过对抗机制消除光谱变化和残留变化(即,点亮,姿势,表达,闭塞等)。最后,我们对三个分解特征强加了DRD,使它们彼此无关紧要,这可以更有效地分离三个组件信息和增强特征表示。特别是,我们开发联合三阶段优化(JTSO)策略,以有效优化网络。联合配方导致身份信息的纯化和课堂内变异信息的解剖学。在三个挑战性数据集中进行了广泛的实验,结果证明了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号