Deep Spatial-Temporal Fusion Network for Video-Based Person Re-identification

机译：深度时空融合网络用于基于视频的人员重新识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel deep end-to-end network to automatically learn the spatial-temporal fusion features for video-based person re-identification. Specifically, the proposed network consists of CNN and RNN to jointly learn both the spatial and the temporal features of input image sequences. The network is optimized by utilizing the siamese and softmax losses simultaneously to pull the instances of the same person closer and push the instances of different persons apart. Our network is trained on full-body and part-body image sequences respectively to learn complementary representations from holistic and local perspectives. By combining them together, we obtain more discriminative features that are beneficial to person re-identification. Experiments conducted on the PRID-2011, i-LIDS-VIS and MARS datasets show that the proposed method performs favorably against existing approaches.

机译：在本文中，我们提出了一种新颖的深度端到端网络，该网络可以自动学习时空融合特征，以进行基于视频的人员重新识别。具体来说，所提出的网络由CNN和RNN组成，以共同学习输入图像序列的空间和时间特征。通过同时利用暹罗和softmax损失来拉近同一个人的实例并将不同个人的实例推开，从而优化了网络。我们的网络分别经过全身和局部图像序列训练，以从整体和局部角度学习互补表示。通过将它们组合在一起，我们获得了更多的区分特征，这些特征有利于人的重新识别。在PRID-2011，i-LIDS-VIS和MARS数据集上进行的实验表明，该方法相对于现有方法具有良好的性能。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition Workshops》|2017年|1478-1485|共8页
会议地点
作者
Lin Chen; Hua Yang; Ji Zhu; Qin Zhou; Shuang Wu; Zhiyong Gao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Image sequences; Data mining; Fuses; Visualization; Lighting; Recurrent neural networks;

机译：特征提取;图像序列;数据挖掘;保险丝;可视化;照明;递归神经网络;

相似文献

外文文献
中文文献
专利

1. Video-based person re-identification via spatio-temporal attentional and two-stream fusion convolutional networks [J] . Ouyang Deqiang, Zhang Yonghui, Shao Jie Pattern recognition letters . 2019,第JANa期

机译：通过时空注意和两流融合卷积网络的基于视频的人重新识别
2. Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification [J] . Wu Lin, Wang Yang, Gao Junbin, IEEE transactions on multimedia . 2019,第6期

机译：何时何地：基于视频的人员重新识别的深层暹罗注意网络
3. Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification [J] . Wu Lin, Wang Yang, Gao Junbin, IEEE transactions on multimedia . 2019,第6期

机译：在哪里 - 和何时看的：深度暹罗关注网络用于视频的人重新识别
4. Deep Spatial-Temporal Fusion Network for Video-Based Person Re-identification [C] . Lin Chen, Hua Yang, Ji Zhu, IEEE Conference on Computer Vision and Pattern Recognition Workshops . 2017

机译：基于视频的人的深空间融合网络重新识别
5. Person Re-Identification and an Adversarial Attack and Defense for Person Re-Identification Networks [D] . Zheng, Yu. 2021

机译：人员重新识别和对侵犯人员重新识别网络的侵犯攻击和辩护
6. Video-Based Person Re-Identification by an End-To-End Learning Architecture with Hybrid Deep Appearance-Temporal Feature [O] . Rui Sun, Qiheng Huang, Miaomiao Xia, 2018

机译：具有混合深度外观-时态特征的端到端学习体系结构的基于视频的人员重新识别
7. Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification [O] . Xu, Shuangjie, Cheng, Yu, Gu, Kang, 2017

机译：基于视频的共时关注时空池网络人员重新识别

Deep Spatial-Temporal Fusion Network for Video-Based Person Re-identification

摘要

著录项

相似文献

相关主题

期刊订阅