首页> 外文期刊>Multimedia Tools and Applications >Person re-identification based on multi-level feature complementarity of cross-attention with part metric learning
【24h】

Person re-identification based on multi-level feature complementarity of cross-attention with part metric learning

机译:基于多级特征互补性的人重新识别与部件度量学习的跨关键

获取原文
获取原文并翻译 | 示例
           

摘要

Person re-identification is an image retrieval task, and its task is to perform a person matching in different cameras by a given person target. This research has been noticed and studied by more and more people. However, pose changes and occlusions often occur during a person walking. Especially in the most related methods, local features are not used to simply and effectively solve the problems of occlusion and pose changes. Moreover, the metric loss functions only consider the image-level case, and it cannot adjust the distance between local features well. To tackle the above problems, a novel person re-identification scheme is proposed. Through experiments, we found that we paid more attention to different parts of a person when we look at him from a horizontal or vertical perspective respectively. First, in order to solve the problem of occlusion and pose changes, we propose a Cross Attention Module (CAM). It enables the network to generate a cross attention map and improve the accuracy of person re-identification via the enhancement of the most significant local features of persons. The horizontal and vertical attention vectors of the feature maps are extracted and a cross attention map is generated, and the local key features are enhanced by this attention map. Second, in order to solve the problem of the lack of expression ability of the single-level feature maps, we propose a Multi-Level Feature Complementation Module (MLFCM). In this module, the missing information of high-level features is complemented by low-level features via short skip. Feature selection is also performed among deep features maps. The purpose of this module is to get the feature maps with complete information. Further, this module solves the problem of missing contour features in high-level semantic features. Third, in order to solve the problem that the current metric loss function cannot adjust the distance between local features, we propose Part Triple Loss Function (PTLF). It can reduce both within-class and increase between-class distance of the person parts. Experimental results show that our model achieves high values on Rank-k and mAP on Market-1501, Duke-MTMC and CUHK03-NP.
机译:人重新识别是一种图像检索任务,其任务是通过给定的人目标执行在不同摄像机中匹配的人。这项研究已经注意到并通过越来越多的人研究。然而,在一个人行走期间,姿势变化和闭塞通常会发生。特别是在最相关的方法中,局部特征不用于简单有效地解决闭塞和姿势变化的问题。此外,度量损失函数仅考虑图像级别案例,并且无法调整本地特征之间的距离。为了解决上述问题,提出了一种新的人重新识别方案。通过实验,我们发现,当我们分别从水平或垂直角度看他时,我们更加关注一个人的不同部分。首先,为了解决闭塞和姿势变化的问题,我们提出了一种跨关注模块(CAM)。它使网络能够通过增强人的最重要的当地特征来生成跨关注图并提高人员重新识别的准确性。提取特征映射的水平和垂直注意矢量,并产生跨关注图,并且通过该注意图提高了本地关键特征。其次,为了解决单层特征映射缺乏表达能力的问题,我们提出了一个多级别的特征互补模块(MLFCM)。在该模块中,高级功能的缺失信息通过短跳过而互补的低级功能。在深度特征映射中也执行特征选择。此模块的目的是使用完整信息获取特征映射。此外,该模块解决了高级语义特征中缺少轮廓特征的问题。第三,为了解决当前度量损失函数无法调整本地特征之间的距离的问题,我们提出了三重损耗功能(PTLF)。它可以减少课堂内部和人员零件的班级之间的增加。实验结果表明,我们的模型在市场-1501,Duke-MTMC和CUHK03-NP上实现了高秩-K和地图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号