首页> 外文会议>AAAI Conference on Artificial Intelligence >Learning Heterogeneous Dictionary Pair with Feature Projection Matrix for Pedestrian Video Retrieval via Single Query Image
【24h】

Learning Heterogeneous Dictionary Pair with Feature Projection Matrix for Pedestrian Video Retrieval via Single Query Image

机译:学习异构词典对与单一查询图像的步行视频检索特征投影矩阵

获取原文

摘要

Person re-identification (re-id) plays an important role in video surveillance and forensics applications. In many cases, person re-id needs to be conducted between image and video clip, e.g., re-identifying a suspect from large quantities of pedestrian videos given a single image of him. We call re-id in this scenario as image to video person re-id (IVPR). In practice, image and video are usually represented with different features, and there usually exist large variations between frames within each video. These factors make matching between image and video become a very challenging task. In this paper, we propose a joint feature projection matrix and heterogeneous dictionary pair learning (PHDL) approach for IVPR. Specifically, PHDL jointly learns an intra-video projection matrix and a pair of heterogeneous image and video dictionaries. With the learned projection matrix, the influence of variations within each video to the matching can be reduced. With the learned dictionary pair, the heterogeneous image and video features can be transformed into coding coefficients with the same dimension, such that the matching can be conducted using coding coefficients. Furthermore, to ensure that the obtained coding coefficients have favorable discriminability, PHDL designs a point-to-set coefficient discriminant term. Experiments on the public iLIDS-VID and PRID 2011 datasets demonstrate the effectiveness of the proposed approach.
机译:人重新识别(RE-ID)在视频监控和取证应用中起着重要作用。在许多情况下,需要在图像和视频剪辑之间进行人员重新识别,例如,根据他的单一形象重新识别来自大量行人视频的嫌疑人。我们将此方案中的RE-ID称为视频人重新ID(IVPR)。在实践中,图像和视频通常用不同的特征表示,并且通常存在每个视频内的帧之间存在大的变化。这些因素使图像与视频之间的匹配成为一个非常具有挑战性的任务。在本文中,我们提出了一种针对IVPR的关节特征投影矩阵和异构词典对学习(PHDL)方法。具体而言,PHDL共同学习视频内投影矩阵和一对异构图像和视频词典。利用学习的投影矩阵,可以减少每个视频内的变化对匹配的影响。利用学习的字典对,可以将异构图像和视频特征变换为具有相同维度的编码系数,使得可以使用编码系数进行匹配。此外,为了确保所获得的编码系数具有良好的辨别性,PHDL设计了一种点对点系数判别项。公共ILIDS-VID和PRID 2011数据集的实验证明了拟议方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号