Deep gated attention networks for large-scale street-level scene segmentation

Zhang Pingping; Liu Wei; Wang Hongyu; Lei Yinjie; Lu Huchuan

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Deep gated attention networks for large-scale street-level scene segmentation

【24h】

Deep gated attention networks for large-scale street-level scene segmentation

机译：大型街道级场景细分的深入门控注意网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Street-level scene segmentation aims to label each pixel of street-view images into specific semantic categories. It has been attracting growing interest due to various real-world applications, especially in the area of autonomous driving. However, this pixel-wise labeling task is very challenging under the complex street-level scenes and large-scale object categories. Motivated by the scene layout of street-view images, in this work we propose a novel Spatial Gated Attention (SGA) module, which automatically highlights the attentive regions for pixel-wise labeling, resulting in effective street-level scene segmentation. The proposed module takes as input the multi-scale feature maps based on a Fully Convolutional Network (FCN) backbone, and produces the corresponding attention mask for each feature map. The learned attention masks can neatly highlight the regions of interest while suppress background clutter. Furthermore, we propose an efficient multi-scale feature interaction mechanism which is able to adaptively aggregate the hierarchical features. Based on the proposed mechanism, the features of different levels are adaptively re-weighted according to the local spatial structure and the surrounding contextual information. Consequently, the proposed modules are able to boost standard FCN architectures and result in an enhanced pixel-wise segmentation for street-level scene images. Extensive experiments on three public available street-level benchmarks demonstrate that the proposed Gated Attention Network (GANet) approach achieves consistently superior performance and outperforms the very recent state-of-the-art methods. (C) 2018 Elsevier Ltd. All rights reserved.

机译：街道级场景分割旨在将街道视图图像的每个像素标记为特定的语义类别。由于各种现实世界应用，它一直吸引了日益增长的兴趣，特别是在自动驾驶领域。但是，在复杂的街道级场景和大规模对象类别下，这种像素明智的标签任务非常具有挑战性。通过街道视图图像的场景布局，在这项工作中，我们提出了一种新颖的空间门控注意力（SGA）模块，它自动突出显示映值的细节区域，从而产生有效的街道级场景分割。所提出的模块基于完全卷积网络（FCN）骨干，并为每个特征映射产生相应的关注掩模。所学到的注意面具可以整齐地突出抑制背景杂波的感兴趣区域。此外，我们提出了一种有效的多尺度特征交互机制，其能够自适应地聚合分层特征。基于所提出的机制，根据局部空间结构和周围的上下文信息，自适应地重新加权不同水平的特征。因此，所提出的模块能够促进标准FCN架构，并导致用于街道级场景图像的增强像素方向分割。在三个公共可用街道级基准测试中的广泛实验表明，拟议的门控注意网络（Ganet）方法始终卓越的性能和优于最近最先进的方法。（c）2018年elestvier有限公司保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2019年第2019期|共13页
作者
Zhang Pingping; Liu Wei; Wang Hongyu; Lei Yinjie; Lu Huchuan;
展开▼
作者单位

Dalian Univ Technol Sch Informat &

Commun Engn Dalian 116024 Liaoning Peoples R China;

Shanghai Jiao Tong Univ Minist Educ Syst Control &

Informat Proc Key Lab Shanghai 200240 Peoples R China;

Dalian Univ Technol Sch Informat &

Commun Engn Dalian 116024 Liaoning Peoples R China;

Sichuan Univ Coll Elect &

Informat Engn Chengdu 610065 Sichuan Peoples R China;

Dalian Univ Technol Sch Informat &

Commun Engn Dalian 116024 Liaoning Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Scene segmentation; Fully convolutional network; Spatial gated attention; Street-level image understanding;

机译：场景分割;完全卷积网络;空间门腺关注;街道层面的图像理解;

相似文献

外文文献
中文文献
专利

1. Deep gated attention networks for large-scale street-level scene segmentation [J] . Zhang Pingping, Liu Wei, Wang Hongyu, Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：大型街道级场景细分的深入门控注意网络
2. Automatic segmentation of gross target volume of nasopharynx cancer using ensemble of multiscale deep neural networks with spatial attention [J] . Mei Haochen, Lei Wenhui, Gu Ran, Neurocomputing . 2021,第MAYa28期

机译：用空间关注的多尺度深神经网络的集合自动分割鼻咽癌癌症
3. Anatomical Attention Guided Deep Networks for ROI Segmentation of Brain MR Images [J] . Sun Liang, Shao Wei, Zhang Daoqiang, IEEE Transactions on Medical Imaging . 2020,第6期

机译：解剖关注引导深度网络脑MR图像的ROI分割
4. Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes [C] . Siavash Gorji, James J. Clark IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：注意推送：在社交场景中使用共享注意模型来增强图像显着性的深度卷积网络
5. Video content extraction: Scene segmentation, linking and attention detection. [D] . Zhai, Yun. 2006

机译：视频内容提取：场景分割，链接和注意力检测。
6. Automatic Lung Segmentation on Chest X-rays Using Self-Attention Deep Neural Network [O] . Minki Kim, Byoung-Dai Lee 2021

机译：胸部X射线自动肺部使用自我关注深神经网络自动肺分割
7. Cars Can’t Fly Up in the Sky: Improving Urban-Scene Segmentation via Height-Driven Attention Networks [O] . Sungha Choi, Joanne T. Kim, Jaegul Choo 2020

机译：汽车无法在天空中飞行：通过高度驱动的注意网络改善城市场景细分

Deep gated attention networks for large-scale street-level scene segmentation

摘要

著录项

相似文献

相关主题

期刊订阅