Deep Multispectral Semantic Scene Understanding of Forested Environments Using Multimodal Fusion

机译：使用多模式融合的深度多光谱语义场景理解森林环境

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic scene understanding of unstructured environments is a highly challenging task for robots operating in the real world. Deep Convolutional Neural Network architectures define the state of the art in various segmentation tasks. So far, researchers have focused on segmentation with RGB data. In this paper, we study the use of multispectral and multimodal images for semantic segmentation and develop fusion architectures that learn from RGB, Near-InfraRed channels, and depth data. We introduce a first-of-its-kind multispectral segmentation benchmark that contains 15,000 images and 366 pixel-wise ground truth annotations of unstructured forest environments. We identify new data augmentation strategies that enable training of very deep models using relatively small datasets. We show that our UpNet architecture exceeds the state of the art both qualitatively and quantitatively on our benchmark. In addition, we present experimental results for segmentation under challenging real-world conditions. Benchmark and demo are publicly available at http://deepscene.cs.uni-freiburg.de.

机译：对非结构化环境的语义现场了解是一个高度挑战的现实世界的机器人任务。深度卷积神经网络架构在各种分割任务中定义了本领域的状态。到目前为止，研究人员专注于RGB数据的分割。在本文中，我们研究了使用多级和多模式图像进行语义分割和开发学习的融合架构，从RGB，近红外通道和深度数据中学习。我们介绍了一系列的多光谱分割基准，其中包含15,000个图像和366个像素 - 明智的非结构化林环境的实践注释。我们确定使用相对较小的数据集启用非常深模型的新数据增强策略。我们表明，我们的UPNet架构在我们的基准测试中定制和定量地超出了最先进的艺术状态。此外，我们提出了在挑战现实世界条件下的分割实验结果。基准和演示在http://deepscene.cs.uni-freiburg.de上公开提供。

著录项

来源
《International Symposium on Experimental Robotics》|2017年|856p|共13页
会议地点
作者
Abhinav Valada; Gabriel L. Oliveira; Thomas Brox; Wolfram Burgard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP24-53;
关键词
Semantic segmentation; Convolutional neural networks; Scene understanding; Multimodal perception;

机译：语义分割;卷积神经网络;现场了解;多式化感知;

相似文献

外文文献
中文文献
专利

1. FUSION OF HYPERSPECTRAL, MULTISPECTRAL, COLOR AND 3D POINT CLOUD INFORMATION FOR THE SEMANTIC INTERPRETATION OF URBAN ENVIRONMENTS [J] . Weinmann M. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2019,第2aW8期

机译：超光谱，多光谱，彩色和3D点云信息融合对城市环境的语义解释
2. Multimodal information fusion for urban scene understanding [J] . Philippe Xu, Franck Davoine, Jean-Baptiste Bordes, Machine Vision and Applications . 2016,第3期

机译：多模式信息融合，有助于城市场景理解
3. Deep fusion of multi-view and multimodal representation of ALS point cloud for 3D terrain scene recognition [J] . Nannan Qin, Xiangyun Hu, Hengming Dai ISPRS Journal of Photogrammetry and Remote Sensing . 2018,第SEPa期

机译：ALS点云的多视图和多模式表示的深度融合，用于3D地形场景识别
4. Deep Multispectral Semantic Scene Understanding of Forested Environments Using Multimodal Fusion [C] . Abhinav Valada, Gabriel L. Oliveira, Thomas Brox, International Symposium on Experimental Robotics . 2017

机译：使用多模式融合的深度多光谱语义场景理解森林环境
5. Deep Multimodal Fusion Networks for Semantic Segmentation [D] . Tetreault, Jesse 2017

机译：用于语义分割的深层多模融合网络
6. Stabilization and Validation of 3D Object Position Using Multimodal Sensor Fusion and Semantic Segmentation [O] . Mircea Paul Muresan, Ion Giosan, Sergiu Nedevschi 2020

机译：使用多模式传感器融合和语义分割来稳定和验证3D对象位置
7. Exploration of Deep Learning-based Multimodal Fusion for Semantic Road Scene Segmentation [O] . Yifei Zhang, Olivier Morel, Marc Blanchon, 2019

机译：基于深度学习的多峰融合探讨对语义路面分割的影响
8. Deciduous Forest Scene Complexity in High-Resolution Multispectral ScannerImagery [R] . Balick, L. K. 1992

机译：高分辨率多光谱扫描仪成像中落叶林场景的复杂性

Deep Multispectral Semantic Scene Understanding of Forested Environments Using Multimodal Fusion

摘要

著录项

相似文献

相关主题

期刊订阅