Spatial-Temporal Graph Convolutional Network for Video-Based Person Re-Identification

机译：基于时空图卷积网络的视频人识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While video-based person re-identification (Re-ID) has drawn increasing attention and made great progress in recent years, it is still very challenging to effectively overcome the occlusion problem and the visual ambiguity problem for visually similar negative samples. On the other hand, we observe that different frames of a video can provide complementary information for each other, and the structural information of pedestrians can provide extra discriminative cues for appearance features. Thus, modeling the temporal relations of different frames and the spatial relations within a frame has the potential for solving the above problems. In this work, we propose a novel Spatial-Temporal Graph Convolutional Network (STGCN) to solve these problems. The STGCN includes two GCN branches, a spatial one and a temporal one. The spatial branch extracts structural information of a human body. The temporal branch mines discriminative cues from adjacent frames. By jointly optimizing these branches, our model extracts robust spatial-temporal information that is complementary with appearance information. As shown in the experiments, our model achieves state-of-the-art results on MARS and DukeMTMC-VideoReID datasets.

机译：尽管基于视频的人员重新识别（Re-ID）近年来引起了越来越多的关注并取得了长足的进步，但是要有效克服视觉相似的负样本的遮挡问题和视觉歧义问题，仍然是非常具有挑战性的。另一方面，我们观察到视频的不同帧可以为彼此提供补充信息，而行人的结构信息可以为外观特征提供额外的判别线索。因此，对不同帧的时间关系和帧内的空间关系进行建模具有解决上述问题的潜力。在这项工作中，我们提出了一种新颖的时空图卷积网络（STGCN）来解决这些问题。 STGCN包括两个GCN分支，一个空间分支和一个时间分支。空间分支提取人体的结构信息。时态分支从相邻帧中挖掘判别线索。通过共同优化这些分支，我们的模型提取了与外观信息互补的鲁棒的时空信息。如实验所示，我们的模型在MARS和DukeMTMC-VideoReID数据集上获得了最先进的结果。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|3286-3296|共11页
会议地点
作者
Jinrui Yang; Wei-Shi Zheng; Qize Yang; Ying-Cong Chen; Qi Tian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Computational modeling; Robustness; Optical computing; Optical imaging; Computer vision; Cameras;

机译：特征提取;计算建模;稳健性;光学计算;光学成像;计算机视觉;相机;

相似文献

外文文献
中文文献
专利

1. Video-based person re-identification via spatio-temporal attentional and two-stream fusion convolutional networks [J] . Ouyang Deqiang, Zhang Yonghui, Shao Jie Pattern recognition letters . 2019,第JANa期

机译：通过时空注意和两流融合卷积网络的基于视频的人重新识别
2. Video-based person re-identification with parallel spatial-temporal attention module [J] . Kong Jun, Teng Zhende, Jiang Min, Journal of electronic imaging . 2020,第1期

机译：基于视频的人重新识别并行空间 - 时间注意模块
3. Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification [J] . Chen Guangyi, Lu Jiwen, Yang Ming, IEEE Transactions on Image Processing . 2019,第9期

机译：时空注意感知学习对基于视频的人的重新识别
4. Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification [C] . Shuangjie Xu, Yu Cheng, Kang Gu, IEEE International Conference on Computer Vision . 2017

机译：共同关注基于视频的人的空间汇总网络重新识别
5. Person Re-Identification and an Adversarial Attack and Defense for Person Re-Identification Networks [D] . Zheng, Yu. 2021

机译：人员重新识别和对侵犯人员重新识别网络的侵犯攻击和辩护
6. Superpixel-Based Temporally Aligned Representation for Video-Based Person Re-Identification [O] . Changxin Gao, Jin Wang, Leyuan Liu, 2019

机译：基于超像素的临时对齐表示用于基于视频的人员重新识别
7. Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification [O] . Xu, Shuangjie, Cheng, Yu, Gu, Kang, 2017

机译：基于视频的共时关注时空池网络人员重新识别

Spatial-Temporal Graph Convolutional Network for Video-Based Person Re-Identification

摘要

著录项

相似文献

相关主题

期刊订阅