首页> 外文OA文献 >Semantic Labeling in Remote Sensing Corpora Using Feature Fusion-Based Enhanced Global Convolutional Network with High-Resolution Representations and Depthwise Atrous Convolution
【2h】

Semantic Labeling in Remote Sensing Corpora Using Feature Fusion-Based Enhanced Global Convolutional Network with High-Resolution Representations and Depthwise Atrous Convolution

机译:使用具有高分辨率表示和深度不足的全局卷积网络的特征融合增强的全球卷积网络,遥感语料库中的语义标记

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

One of the fundamental tasks in remote sensing is the semantic segmentation on the aerial and satellite images. It plays a vital role in applications, such as agriculture planning, map updates, route optimization, and navigation. The state-of-the-art model is the Enhanced Global Convolutional Network (GCN152-TL-A) from our previous work. It composes two main components: (i) the backbone network to extract features and ( i i ) the segmentation network to annotate labels. However, the accuracy can be further improved, since the deep learning network is not designed for recovering low-level features (e.g., river, low vegetation). In this paper, we aim to improve the semantic segmentation network in three aspects, designed explicitly for the remotely sensed domain. First, we propose to employ a modern backbone network called “High-Resolution Representation (HR)” to extract features with higher quality. It repeatedly fuses the representations generated by the high-to-low subnetworks with the restoration of the low-resolution representations to the same depth and level. Second, “Feature Fusion (FF)” is added to our network to capture low-level features (e.g., lines, dots, or gradient orientation). It fuses between the features from the backbone and the segmentation models, which helps to prevent the loss of these low-level features. Finally, ”Depthwise Atrous Convolution (DA)” is introduced to refine the extracted features by using four multi-resolution layers in collaboration with a dilated convolution strategy. The experiment was conducted on three data sets: two private corpora from Landsat-8 satellite and one public benchmark from the “ISPRS Vaihingen” challenge. There are two baseline models: the Deep Encoder-Decoder Network (DCED) and our previous model. The results show that the proposed model significantly outperforms all baselines. It is the winner in all data sets and exceeds more than 90% of F 1 : 0.9114, 0.9362, and 0.9111 in two Landsat-8 and ISPRS Vaihingen data sets, respectively. Furthermore, it achieves an accuracy beyond 90% on almost all classes.
机译:遥感中的一个基本任务是空中和卫星图像上的语义分割。它在应用程序中起着至关重要的作用,例如农业规划,地图更新,路由优化和导航。最先进的模型是我们以前的工作中增强的全球卷积网络(GCN152-TL-A)。它构成了两个主要组件:(i)骨干网络提取特征和(i)分段网络以注释标签。然而,可以进一步提高精度,因为深度学习网络不是设计用于恢复低级特征(例如,河流,低植被)。在本文中,我们的目标是在三个方面改进语义分割网络,明确设计为远程感测域。首先,我们建议使用称为“高分辨率表示(HR)”的现代骨干网络,以提取具有更高质量的特征。它反复融合由高低子网生成的表示,以恢复低分辨率表示到相同的深度和级别。其次,“特征融合(FF)”被添加到我们的网络中以捕获低级功能(例如,线条,点或梯度方向)。它在骨干网和分段模型之间的功能之间保密,有助于防止这些低级功能的丢失。最后,引入“深度不受欢迎的卷积(DA)”以通过使用四个多分辨率层与扩张的卷积策略合作优化提取的特征。该实验是在三种数据集中进行的:来自Landsat-8卫星的两种私人集团,并从“ISPRS Vaihingen”挑战中的一个公共基准。有两个基线模型:深度编码器 - 解码器网络(DCED)和我们之前的模型。结果表明,所提出的模型显着优于所有基线。它是所有数据集中的赢家,分别超过了两个LANDSAT-8和ISPRS Vaihingen数据集的F 1:0.9114,0.9362和0.9111的90%以上。此外,它几乎可以在几乎所有课程上实现超过90%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号