Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

Wu Lin; Wang Yang; Gao Junbin; Li Xue

首页> 外文期刊>IEEE transactions on multimedia >Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

【24h】

Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

机译：在哪里 - 和何时看的：深度暹罗关注网络用于视频的人重新识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video-based person re-identification (re-id) is a central application in surveillance systems with a significant concern in security. Matching persons across disjoint camera views in their video fragments are inherently challenging due to the large visual variations and uncontrolled frame rates. There are two steps crucial to person re-id, namely, discriminative feature learning and metric learning. However, existing approaches consider the two steps independently, and they do not make full use of the temporal and spatial information in the videos. In this paper, we propose a Siamese attention architecture that jointly learns spatiotemporal video representations and their similarity metrics. The network extracts local convolutional features from regions of each frame and enhances their discriminative capability by focusing on distinct regions when measuring the similarity with another pedestrian video. The attention mechanism is embedded into spatial gated recurrent units to selectively propagate relevant features and memorize their spatial dependencies through the network. The model essentially learns which parts (where) from which frames (when) are relevant and distinctive for matching persons and attaches higher importance therein. The proposed Siamese model is end-to-end trainable to jointly learn comparable hidden representations for paired pedestrian videos and their similarity value. Extensive experiments on three benchmark datasets show the effectiveness of each component of the proposed deep network while outperforming state-of-the-art methods.

机译：基于视频的人重新识别（RE-ID）是监控系统中的核心应用，具有重要的安全性。由于较大的视觉变化和不受控制的帧速率，在其视频片段中跨越相机视图的匹配人本质上是具有挑战性的。人员重新ID有两个步骤，即辨别特征学习和度量学习。但是，现有方法独立地考虑两步，并且他们不会充分利用视频中的时间和空间信息。在本文中，我们提出了一个暹罗注意力架构，共同学习时空视频表示及其相似度指标。该网络从每个帧的区域提取局部卷积特征，并通过在测量与另一行人视频的相似性时聚焦不同的区域来增强它们的辨别能力。注意机制嵌入到空间门控复发单元中，以选择性地传播相关特征并通过网络记住其空间依赖性。该模型基本上了解哪些部分（其中）帧（当时）与匹配人员相关的和独特，并附加在其中更高的重要性。拟议的暹罗模型是结束培训，共同学习配对的行人视频及其相似值的可比隐藏表示。三个基准数据集的广泛实验显示了所提出的深网络每个组件的有效性，同时表现优于现有的方法。

著录项

来源
《IEEE transactions on multimedia》 |2019年第6期|1412-1424|共13页
作者
Wu Lin; Wang Yang; Gao Junbin; Li Xue;
展开▼
作者单位

Hefei Univ Technol Sch Comp Sci & Informat Engn Hefei 230000 Anhui Peoples R China|Univ Queensland St Lucia Qld 4072 Australia;

Dalian Univ Technol Dalian 116024 Peoples R China;

Univ Sydney Discipline Business Analyt Sch Business Sydney NSW 2006 Australia;

Univ Queensland St Lucia Qld 4072 Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Video-based person re-identification; gated recurrent units; spatial correlations; visual attention;

机译：基于视频的人重新识别;门控复发单位;空间相关;视觉关注;

相似文献

外文文献
中文文献
专利

1. Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification [J] . Wu Lin, Wang Yang, Gao Junbin, IEEE transactions on multimedia . 2019,第6期

机译：何时何地：基于视频的人员重新识别的深层暹罗注意网络
2. Video-based person re-identification via spatio-temporal attentional and two-stream fusion convolutional networks [J] . Ouyang Deqiang, Zhang Yonghui, Shao Jie Pattern recognition letters . 2019,第JANa期

机译：通过时空注意和两流融合卷积网络的基于视频的人重新识别
3. Triplet Attention Network for Video-Based Person Re-Identification [J] . Rui SUN, Qili LIANG, Zi YANG, IEICE transactions on information and systems . 2021,第10期

机译：基于视频的人的Triplet注意网络重新识别
4. Deep Block Attention and Global-local Aggregation Network for Video-based Person re-identification [C] . Rui Sun, Qili Liang, Xinjian Gao, International Conference on Intelligent Computing and Signal Processing . 2021

机译：基于视频的人重新识别的深度阻止关注和全球局部聚合网络
5. Person Re-Identification and an Adversarial Attack and Defense for Person Re-Identification Networks [D] . Zheng, Yu. 2021

机译：人员重新识别和对侵犯人员重新识别网络的侵犯攻击和辩护
6. Relation-Based Deep Attention Network with Hybrid Memory for One-Shot Person Re-Identification [O] . Runxuan Si, Jing Zhao, Yuhua Tang, 2021

机译：基于关系的深度关注网络用于单击人的混合记忆重新识别
7. Non-Local Spatial and Temporal Attention Network for Video-Based Person Re-Identification [O] . Zheng Liu, Feixiang Du, Wang Li, 2020

机译：基于视频的人的非局部空间和时间注意网络重新识别

Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

摘要

著录项

相似文献

相关主题

期刊订阅