Video semantic segmentation via feature propagation with holistic attention

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Video semantic segmentation via feature propagation with holistic attention

【24h】

Video semantic segmentation via feature propagation with holistic attention

机译：视频语义分割通过具有整体关注的特征传播

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Since the frames of a video are inherently contiguous, information redundancy is ubiquitous. Unlike previous works densely process each frame of a video, in this paper we present a novel method to focus on efficient feature propagation across frames to tackle the challenging video semantic segmentation task. Firstly, we propose a Light, Efficient and Real-time network (denoted as LERNet) as a strong backbone network for per-frame processing. Then we mine rich features within a key frame and propagate the across-frame consistency information by calculating a temporal holistic attention with the following non-key frame. Each element of the attention matrix represents the global correlation between pixels of a non-key frame and the previous key frame. Concretely, we propose a brand-new attention module to capture the spatial consistency on low-level features along temporal dimension. Then we employ the attention weights as a spatial transition guidance for directly generating high-level features of the current non-key frame from the weighted corresponding key frame. Finally, we efficiently fuse the hierarchical features of the non-key frame and obtain the final segmentation result. Extensive experiments on two popular datasets, i.e. the CityScapes and the CamVid, demonstrate that the proposed approach achieves a remarkable balance between inference speed and accuracy. (C) 2020 Elsevier Ltd. All rights reserved.

机译：由于视频的帧是固有的连续性，因此信息冗余是普遍存在的。与以前的作品不同，处理每个帧的视频，在本文中，我们介绍了一种专注于跨框架的有效特征传播的新方法来解决具有挑战性的视频语义分段任务。首先，我们提出了一种光，高效和实时网络（表示为Lernet）作为用于每个帧处理的强骨干网络。然后我们通过以下非关键帧计算时间整体注意，在关键框架内进行丰富的功能，并通过以下非关键帧来传播跨帧一致性信息。注意矩阵的每个元素表示非关键帧和先前密钥帧的像素之间的全局相关性。具体地，我们提出了一个全新的注意模块，以沿着时间尺寸捕获对低电平特征的空间一致性。然后，我们使用注意力作为空间转换引导，用于直接从加权对应的关键帧直接产生电流非键帧的高级特征。最后，我们有效地融合了非关键帧的分层特征，并获得了最终的分段结果。在两个流行的数据集上进行广泛的实验，即城市景观和Camvid，表明所提出的方法在推广速度和准确性之间实现了显着平衡。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2020年第2020期|共11页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Real-time; Attention mechanism; Feature propagation; Video semantic segmentation;

机译：实时;注意机制;特征传播;视频语义分割;

相似文献

外文文献
中文文献
专利

1. Video semantic segmentation via feature propagation with holistic attention [J] . Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：视频语义分割通过具有整体关注的特征传播
2. Flow-guided feature propagation with occlusion aware detail enhancement for hand segmentation in egocentric videos [J] . Li Minglei, Sun Lei, Huo Qiang Computer vision and image understanding . 2019,第Octa期

机译：流导向的特征传播，具有遮挡感知的细节增强功能，用于以自我为中心的视频中的手部分割
3. Discriminative Feature Network Based on a Hierarchical Attention Mechanism for Semantic Hippocampus Segmentation [J] . Shi Jiali, Zhang Rong, Guo Lijun, Biomedical and Health Informatics, IEEE Journal of . 2021,第2期

机译：基于语义海马分割的分层关注机制的鉴别特征网络
4. Semantics Through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation [C] . Alina Marcu, Vlad Licaret, Dragos Costea, Asian conference on computer vision . 2020

机译：通过时间的语义：具有迭代标签传播的空中视频的半监督分割
5. Video content extraction: Scene segmentation, linking and attention detection. [D] . Zhai, Yun. 2006

机译：视频内容提取：场景分割，链接和注意力检测。
6. Helping the Blind to Get through COVID-19: Social Distancing Assistant Using Real-Time Semantic Segmentation on RGB-D Video [O] . Manuel Martinez, Kailun Yang, Angela Constantinescu, 2020

机译：帮助盲人通过Covid-19：在RGB-D视频上使用实时语义分割来实现社交疏散助理
7. Semantics Through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation [O] . Alina Marcu, Vlad Licaret, Dragos Costea, 2021

机译：通过时间的语义：具有迭代标签传播的空中视频的半监督分割
8. Mining Videos for Features that Drive Attention. [R] . Baluch, F., Itti, L. 2015

机译：挖掘引人注目的功能视频。

Video semantic segmentation via feature propagation with holistic attention

摘要

著录项

相似文献

相关主题

期刊订阅