Multi-scale stacking attention pooling for remote sensing scene classification

Bi Qi; Zhang Han; Qin Kun

首页> 外文期刊>Neurocomputing >Multi-scale stacking attention pooling for remote sensing scene classification

【24h】

Multi-scale stacking attention pooling for remote sensing scene classification

机译：用于遥感场景分类的多尺度堆叠注意力汇总

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Remote sensing image scene classification is challenging due to the complicated spatial arrangement and varied object sizes inside a large-scale aerial image. Among the bottlenecks for current deep learning methods to depict and discriminate the complexity of remote sensing scenes, strengthening the local semantic representation and multi-scale feature representation is necessary. In this paper, we propose a multi-scale staking attention pooling (MS2AP) to tackle these challenges, which has three main contributions. Firstly, it can be conveniently embedded into current CNN models in an end-to-end manner to enhance the feature representation capability for remote sensing scenes. Secondly, we propose a novel residual channel-spatial attention module to mine the key local semantics in the feature maps. Compared with current attention modules, it can fuse top-down discriminative features and bottomup convolution features from both the channel and spatial domain. Thirdly, we propose a multi-scale dilated convolutional operator which can extract multi-scale feature maps and keep their sizes the same. In our MS2AP, these multi-scale feature maps are firstly staked and then down-sampled by a weighted pooling whose weight matrix comes from our attention module. Extensive experiments demonstrate that our MS2AP outperforms the baseline by 4.24% on UCM, 7.22% on AID and 14.12% on NWPU benchmark respectively, and substantially outperforms current state-of-the-art methods by a large margin.Remote sensing image scene classification is challenging due to the complicated spatial arrangement and varied object sizes inside a large-scale aerial image. Among the bottlenecks for current deep learning methods to depict and discriminate the complexity of remote sensing scenes, strengthening the local semantic representation and multi-scale feature representation is necessary. In this paper, we propose a multi-scale staking attention pooling (MS2AP) to tackle these challenges, which has three main contributions. Firstly, it can be conveniently embedded into current CNN models in an end-to-end manner to enhance the feature representation capability for remote sensing scenes. Secondly, we propose a novel residual channel-spatial attention module to mine the key local semantics in the feature maps. Compared with current attention modules, it can fuse top-down discriminative features and bottom up convolution features from both the channel and spatial domain. Thirdly, we propose a multi-scale dilated convolutional operator which can extract multi-scale feature maps and keep their sizes the same. In our MS2AP, these multi-scale feature maps are firstly staked and then down-sampled by a weighted pooling whose weight matrix comes from our attention module. Extensive experiments demonstrate that our MS2AP outperforms the baseline by 4.24% on UCM, 7.22% on AID and 14.12% on NWPU benchmark respectively, and substantially outperforms current state-of-the-art methods by a large margin.(c) 2021 Elsevier B.V. All rights reserved.

机译：遥感图像场景分类由于在大型空中图像内复杂的空间布置和不同的物体尺寸而挑战。在当前深度学习方法描绘和区分遥感场景的复杂性的瓶颈中，需要强化局部语义表示和多尺度特征表示。在本文中，我们提出了一种多规模的铆接注意力汇集（MS2AP）来解决这些挑战，其中有三个主要贡献。首先，可以以端到端的方式方便地嵌入到电流CNN模型中，以增强遥感场景的特征表示能力。其次，我们提出了一种新颖的剩余通道 - 空间注意力模块，用于在特征映射中挖掘关键的本地语义。与目前的注意力模块相比，它可以熔断来自频道和空间域的自上而下的辨别特征和自下而上的卷积功能。第三，我们提出了一种多尺度扩张的卷积运算符，可以提取多尺度特征映射并保持其尺寸。在我们的MS2AP中，首先将这些多尺度特征映射绘制，然后通过权重汇总来对我们的重量矩阵来自我们的注意模块进行下采样。广泛的实验表明，我们的MS2AP分别优于基线，在UCM的援助7.22％，分别对NWPU基准测试7.22％，并通过大型利润率显着优于最新的现有方法。更像感应图像场景分类由于复杂的空间排列和大型空中图像内的不同物体尺寸而挑战。在当前深度学习方法描绘和区分遥感场景的复杂性的瓶颈中，需要强化局部语义表示和多尺度特征表示。在本文中，我们提出了一种多规模的铆接注意力汇集（MS2AP）来解决这些挑战，其中有三个主要贡献。首先，可以以端到端的方式方便地嵌入到电流CNN模型中，以增强遥感场景的特征表示能力。其次，我们提出了一种新颖的剩余通道 - 空间注意力模块，用于在特征映射中挖掘关键的本地语义。与目前的注意力模块相比，它可以熔断来自频道和空间域的自上而下的辨别特征和自下而上的卷积功能。第三，我们提出了一种多尺度扩张的卷积运算符，可以提取多尺度特征映射并保持其尺寸。在我们的MS2AP中，首先将这些多尺度特征映射绘制，然后通过权重汇总来对我们的重量矩阵来自我们的注意模块进行下采样。广泛的实验表明，我们的MS2AP分别优于基线，效率为4.24％，7.22％，分别对NWPU基准测试分别为14.12％，并通过大型利润率大幅优于当前最先进的方法。（c）2021 Elsevier BV版权所有。

著录项

来源
《Neurocomputing》 |2021年第14期|147-161|共15页
作者
Bi Qi; Zhang Han; Qin Kun;
展开▼
作者单位

Wuhan Univ Sch Remote Sensing & Informat Engn Wuhan Peoples R China;

Wuhan Univ Sch Remote Sensing & Informat Engn Wuhan Peoples R China;

Wuhan Univ Sch Remote Sensing & Informat Engn Wuhan Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Scene classification; Attention pooling; Residual attention module; Multi-scale feature representation; Feature fusion; Convolutional neural network;

机译：场景分类;注意汇总;剩余注意模块;多尺度特征表示;特征融合;卷积神经网络;
入库时间 2022-08-19 02:09:56

相似文献

外文文献
中文文献
专利

1. Remote Sensing Scene Classification Using Multilayer Stacked Covariance Pooling [J] . Nanjun He, Leyuan Fang, Shutao Li, IEEE Transactions on Geoscience and Remote Sensing. . 2018,第12期

机译：多层堆叠协方差池的遥感场景分类
2. Multi-scale attentive region adaptive aggregation learning for remote sensing scene classification [J] . Lv Guangrui, Dong Lili, Zhang Wenwen, International journal of remote sensing . 2021,第19a20期

机译：遥感场景分类的多尺度细分区域自适应聚合学习
3. Scene classification of remote sensing image based on deep network and multi-scale features fusion [J] . Yang Zhou, Mu Xiao-dong, Zhao Feng-an Optik: Zeitschrift fur Licht- und Elektronenoptik: = Journal for Light-and Electronoptic . 2018,第期

机译：基于深网络和多尺度特征融合的遥感图像场景分类
4. MLRS-CNN-DWTPL: A New Enhanced Multi-Label Remote Sensing Scene Classification Using Deep Neural Networks with Wavelet Pooling Layers [C] . Said E. El-Khamy, Ahmad Al-Kabbany, Shimaa EL-Bana International Telecommunications Conference . 2021

机译：MLRS-CNN-DWTPL：使用具有小波池层的深神经网络的新增强的多标签遥感场景分类
5. Using New and Long-Term Multi-Scale Remotely Sensed Data to Detect Recurrent Fires and Quantify Their Relationship to Land Cover/Use in Indonesian Peatlands =Analisis kebakaran berulang dan hubungannya dengan perubahan tutupan/penggunaan lahan di lahan ga [D] . Vetrita, Yenni. 2021

机译：使用新的和长期多级远程感测数据来检测反复火灾，并量化其与印度尼西亚泥炭地区的土地覆盖/用途的关系=反复消防分析及其与覆盖/土地使用变化的关系
6. Fuzzy Classification of High Resolution Remote Sensing Scenes Using Visual Attention Features [O] . Linyi Li, Tingbao Xu, Yun Chen 2017

机译：视觉注意特征对高分辨率遥感场景的模糊分类
7. Retraction: Zhu R. et al. Attention-Based Deep Feature Fusion for the Scene Classification of High-Resolution Remote Sensing Images. Remote Sensing. 2019, 11(17), 1996 [O] . Ruixi Zhu, Li Yan, Nan Mo, 2020

机译：撤回：朱R.等。基于关注的高分辨率遥感图像场景分类的深度特征融合。遥感。 2019,11（17），1996年

Multi-scale stacking attention pooling for remote sensing scene classification

摘要

著录项

相似文献

相关主题

期刊订阅