...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Deep gated attention networks for large-scale street-level scene segmentation
【24h】

Deep gated attention networks for large-scale street-level scene segmentation

机译:大型街道级场景细分的深入门控注意网络

获取原文
获取原文并翻译 | 示例
           

摘要

Street-level scene segmentation aims to label each pixel of street-view images into specific semantic categories. It has been attracting growing interest due to various real-world applications, especially in the area of autonomous driving. However, this pixel-wise labeling task is very challenging under the complex street-level scenes and large-scale object categories. Motivated by the scene layout of street-view images, in this work we propose a novel Spatial Gated Attention (SGA) module, which automatically highlights the attentive regions for pixel-wise labeling, resulting in effective street-level scene segmentation. The proposed module takes as input the multi-scale feature maps based on a Fully Convolutional Network (FCN) backbone, and produces the corresponding attention mask for each feature map. The learned attention masks can neatly highlight the regions of interest while suppress background clutter. Furthermore, we propose an efficient multi-scale feature interaction mechanism which is able to adaptively aggregate the hierarchical features. Based on the proposed mechanism, the features of different levels are adaptively re-weighted according to the local spatial structure and the surrounding contextual information. Consequently, the proposed modules are able to boost standard FCN architectures and result in an enhanced pixel-wise segmentation for street-level scene images. Extensive experiments on three public available street-level benchmarks demonstrate that the proposed Gated Attention Network (GANet) approach achieves consistently superior performance and outperforms the very recent state-of-the-art methods. (C) 2018 Elsevier Ltd. All rights reserved.
机译:街道级场景分割旨在将街道视图图像的每个像素标记为特定的语义类别。由于各种现实世界应用,它一直吸引了日益增长的兴趣,特别是在自动驾驶领域。但是,在复杂的街道级场景和大规模对象类别下,这种像素明智的标签任务非常具有挑战性。通过街道视图图像的场景布局,在这项工作中,我们提出了一种新颖的空间门控注意力(SGA)模块,它自动突出显示映值的细节区域,从而产生有效的街道级场景分割。所提出的模块基于完全卷积网络(FCN)骨干,并为每个特征映射产生相应的关注掩模。所学到的注意面具可以整齐地突出抑制背景杂波的感兴趣区域。此外,我们提出了一种有效的多尺度特征交互机制,其能够自适应地聚合分层特征。基于所提出的机制,根据局部空间结构和周围的上下文信息,自适应地重新加权不同水平的特征。因此,所提出的模块能够促进标准FCN架构,并导致用于街道级场景图像的增强像素方向分割。在三个公共可用街道级基准测试中的广泛实验表明,拟议的门控注意网络(Ganet)方法始终卓越的性能和优于最近最先进的方法。 (c)2018年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号