首页> 外文期刊>Neurocomputing >Multi-scale stacking attention pooling for remote sensing scene classification
【24h】

Multi-scale stacking attention pooling for remote sensing scene classification

机译:用于遥感场景分类的多尺度堆叠注意力汇总

获取原文
获取原文并翻译 | 示例
       

摘要

Remote sensing image scene classification is challenging due to the complicated spatial arrangement and varied object sizes inside a large-scale aerial image. Among the bottlenecks for current deep learning methods to depict and discriminate the complexity of remote sensing scenes, strengthening the local semantic representation and multi-scale feature representation is necessary. In this paper, we propose a multi-scale staking attention pooling (MS2AP) to tackle these challenges, which has three main contributions. Firstly, it can be conveniently embedded into current CNN models in an end-to-end manner to enhance the feature representation capability for remote sensing scenes. Secondly, we propose a novel residual channel-spatial attention module to mine the key local semantics in the feature maps. Compared with current attention modules, it can fuse top-down discriminative features and bottomup convolution features from both the channel and spatial domain. Thirdly, we propose a multi-scale dilated convolutional operator which can extract multi-scale feature maps and keep their sizes the same. In our MS2AP, these multi-scale feature maps are firstly staked and then down-sampled by a weighted pooling whose weight matrix comes from our attention module. Extensive experiments demonstrate that our MS2AP outperforms the baseline by 4.24% on UCM, 7.22% on AID and 14.12% on NWPU benchmark respectively, and substantially outperforms current state-of-the-art methods by a large margin.Remote sensing image scene classification is challenging due to the complicated spatial arrangement and varied object sizes inside a large-scale aerial image. Among the bottlenecks for current deep learning methods to depict and discriminate the complexity of remote sensing scenes, strengthening the local semantic representation and multi-scale feature representation is necessary. In this paper, we propose a multi-scale staking attention pooling (MS2AP) to tackle these challenges, which has three main contributions. Firstly, it can be conveniently embedded into current CNN models in an end-to-end manner to enhance the feature representation capability for remote sensing scenes. Secondly, we propose a novel residual channel-spatial attention module to mine the key local semantics in the feature maps. Compared with current attention modules, it can fuse top-down discriminative features and bottom up convolution features from both the channel and spatial domain. Thirdly, we propose a multi-scale dilated convolutional operator which can extract multi-scale feature maps and keep their sizes the same. In our MS2AP, these multi-scale feature maps are firstly staked and then down-sampled by a weighted pooling whose weight matrix comes from our attention module. Extensive experiments demonstrate that our MS2AP outperforms the baseline by 4.24% on UCM, 7.22% on AID and 14.12% on NWPU benchmark respectively, and substantially outperforms current state-of-the-art methods by a large margin.(c) 2021 Elsevier B.V. All rights reserved.
机译:遥感图像场景分类由于在大型空中图像内复杂的空间布置和不同的物体尺寸而挑战。在当前深度学习方法描绘和区分遥感场景的复杂性的瓶颈中,需要强化局部语义表示和多尺度特征表示。在本文中,我们提出了一种多规模的铆接注意力汇集(MS2AP)来解决这些挑战,其中有三个主要贡献。首先,可以以端到端的方式方便地嵌入到电流CNN模型中,以增强遥感场景的特征表示能力。其次,我们提出了一种新颖的剩余通道 - 空间注意力模块,用于在特征映射中挖掘关键的本地语义。与目前的注意力模块相比,它可以熔断来自频道和空间域的自上而下的辨别特征和自下而上的卷积功能。第三,我们提出了一种多尺度扩张的卷积运算符,可以提取多尺度特征映射并保持其尺寸。在我们的MS2AP中,首先将这些多尺度特征映射绘制,然后通过权重汇总来对我们的重量矩阵来自我们的注意模块进行下采样。广泛的实验表明,我们的MS2AP分别优于基线,在UCM的援助7.22%,分别对NWPU基准测试7.22%,并通过大型利润率显着优于最新的现有方法。更像感应图像场景分类由于复杂的空间排列和大型空中图像内的不同物体尺寸而挑战。在当前深度学习方法描绘和区分遥感场景的复杂性的瓶颈中,需要强化局部语义表示和多尺度特征表示。在本文中,我们提出了一种多规模的铆接注意力汇集(MS2AP)来解决这些挑战,其中有三个主要贡献。首先,可以以端到端的方式方便地嵌入到电流CNN模型中,以增强遥感场景的特征表示能力。其次,我们提出了一种新颖的剩余通道 - 空间注意力模块,用于在特征映射中挖掘关键的本地语义。与目前的注意力模块相比,它可以熔断来自频道和空间域的自上而下的辨别特征和自下而上的卷积功能。第三,我们提出了一种多尺度扩张的卷积运算符,可以提取多尺度特征映射并保持其尺寸。在我们的MS2AP中,首先将这些多尺度特征映射绘制,然后通过权重汇总来对我们的重量矩阵来自我们的注意模块进行下采样。广泛的实验表明,我们的MS2AP分别优于基线,效率为4.24%,7.22%,分别对NWPU基准测试分别为14.12%,并通过大型利润率大幅优于当前最先进的方法。(c)2021 Elsevier BV版权所有。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号