Dense Semantic Labeling of Subdecimeter Resolution Images With Convolutional Neural Networks

Michele Volpi; Devis Tuia

首页> 外文期刊>IEEE Transactions on Geoscience and Remote Sensing. >Dense Semantic Labeling of Subdecimeter Resolution Images With Convolutional Neural Networks

【24h】

Dense Semantic Labeling of Subdecimeter Resolution Images With Convolutional Neural Networks

机译：卷积神经网络对亚分米分辨率图像的密集语义标记

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic labeling (or pixel-level land-cover classification) in ultrahigh-resolution imagery (<;10 cm) requires statistical models able to learn high-level concepts from spatial data, with large appearance variations. Convolutional neural networks (CNNs) achieve this goal by learning discriminatively a hierarchy of representations of increasing abstraction. In this paper, we present a CNN-based system relying on a downsample-then-upsample architecture. Specifically, it first learns a rough spatial map of high-level representations by means of convolutions and then learns to upsample them back to the original resolution by deconvolutions. By doing so, the CNN learns to densely label every pixel at the original resolution of the image. This results in many advantages, including: 1) the state-of-the-art numerical accuracy; 2) the improved geometric accuracy of predictions; and 3) high efficiency at inference time. We test the proposed system on the Vaihingen and Potsdam subdecimeter resolution data sets, involving the semantic labeling of aerial images of 9- and 5-cm resolution, respectively. These data sets are composed by many large and fully annotated tiles, allowing an unbiased evaluation of models making use of spatial information. We do so by comparing two standard CNN architectures with the proposed one: standard patch classification, prediction of local label patches by employing only convolutions, and full patch labeling by employing deconvolutions. All the systems compare favorably or outperform a state-of-the-art baseline relying on superpixels and powerful appearance descriptors. The proposed full patch labeling CNN outperforms these models by a large margin, also showing a very appealing inference time.

机译：超高分辨率图像（<; 10 cm）中的语义标记（或像素级别的土地覆盖分类）要求统计模型能够从空间数据中学习高级概念，并且外观变化很大。卷积神经网络（CNN）通过有区别地学习不断增加的抽象表示的层次结构来实现此目标。在本文中，我们提出了一种基于CNN的系统，该系统依赖于降采样-然后升采样架构。具体来说，它首先通过卷积学习高阶表示的粗略空间图，然后学习通过反卷积将它们上采样回到原始分辨率。通过这样做，CNN学会了以图像的原始分辨率密集地标记每个像素。这带来了许多优点，包括：1）最新的数值精度； 2）提高预测的几何精度； 3）推理时效率高。我们在Vaihingen和波茨坦亚分米分辨率数据集上测试了提出的系统，分别涉及9厘米和5厘米分辨率的航拍图像的语义标记。这些数据集由许多大型且带有完整注释的图块组成，从而可以利用空间信息对模型进行无偏评估。为此，我们将两种标准的CNN架构与建议的架构进行了比较：标准补丁分类，仅通过卷积预测局部标签补丁以及通过反卷积进行完整补丁标记。所有系统都依靠超像素和强大的外观描述符进行比较或优于现有基准。提议的全补丁标签CNN在很大程度上优于这些模型，还显示出非常吸引人的推理时间。

著录项

来源
《IEEE Transactions on Geoscience and Remote Sensing.》 |2017年第2期|881-893|共13页
作者
Michele Volpi; Devis Tuia;
展开▼
作者单位

MultiModal Remote Sensing Group, University of Zurich, Zürich, Switzerland;

MultiModal Remote Sensing Group, University of Zurich, Zürich, Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantics; Labeling; Feature extraction; Remote sensing; Image resolution; Data models; Machine learning;

机译：语义;标签;特征提取;遥感;图像分辨率;数据模型;机器学习;

相似文献

外文文献
中文文献
专利

1. Semantic labeling in very high resolution images via a self-cascaded convolutional neural network [J] . Liu Yongcheng, Fan Bin, Wang Lingfeng, ISPRS Journal of Photogrammetry and Remote Sensing . 2018,第NOVa期

机译：通过自级联卷积神经网络对高分辨率图像进行语义标记
2. Semantic labeling of high-resolution aerial images using an ensemble of fully convolutional networks [J] . Xiaofeng Sun, Shuhan Shen, Xiangguo Lin, Journal of Applied Remote Sensing . 2017,第4期

机译：使用完全卷积网络的集合的高分辨率空中图像的语义标记
3. TreeUNet: Adaptive Tree convolutional neural networks for subdecimeter aerial image segmentation [J] . Yue Kai, Yang Lei, Li Ruirui, ISPRS Journal of Photogrammetry and Remote Sensing . 2019,第OCTa期

机译：TreeUNet：用于亚分米级航空图像分割的自适应树卷积神经网络
4. Dense Semantic Labeling of Very-High-Resolution Aerial Imagery and LiDAR with Fully-Convolutional Neural Networks and Higher-Order CRFs [C] . Yansong Liu, Sankaranarayanan Piramanayagam, Sildomar T. Monteiro, IEEE Conference on Computer Vision and Pattern Recognition Workshops . 2017

机译：具有全卷积神经网络和高阶CRF的超高分辨率航空影像和LiDAR的密集语义标记
5. Label Text Recognition Using Image Processing Techniques and Convolutional Neural Networks for Smart Library [D] . Wang, Ziming. 2017

机译：使用图像处理技术标记文本识别和智能库的卷积神经网络
6. Semantic Segmentation of Building Roof in Dense Urban Environment with Deep Convolutional Neural Network: A Case Study Using GF2 VHR Imagery in China [O] . Yuchu Qin, Yunchao Wu, Bin Li, 2019

机译：深度卷积神经网络在密集城市环境中建筑物屋顶的语义分割-以中国GF2 VHR图像为例
7. Dense semantic labeling of subdecimeter resolution images with convolutional neural networks [O] . Volpi, Michele, Tuia, Devis 2017

机译：卷积神经网络对亚分米分辨率图像的密集语义标记

Dense Semantic Labeling of Subdecimeter Resolution Images With Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅