...
首页> 外文期刊>World Wide Web >Multi-scale deep context convolutional neural networks for semantic segmentation
【24h】

Multi-scale deep context convolutional neural networks for semantic segmentation

机译:用于语义分割的多尺度深度上下文卷积神经网络

获取原文
获取原文并翻译 | 示例
           

摘要

Recent years have witnessed the great progress for semantic segmentation using deep convolutional neural networks (DCNNs). This paper presents a novel fully convolutional network for semantic segmentation using multi-scale contextual convolutional features. Since objects in natural images tend to be with various scales and aspect ratios, capturing the rich contextual information is very critical for dense pixel prediction. On the other hand, when going deeper in convolutional layers, the convolutional feature maps of traditional DCNNs gradually become coarser, which may be harmful for semantic segmentation. According to these observations, we attempt to design a multi-scale deep context convolutional network (MDCCNet), which combines the feature maps from different levels of network in a holistic manner for semantic segmentation. The segmentation outputs of MDCCNets are further enhanced using dense connected conditional random fields (CRF). The proposed network allows us to fully exploit local and global contextual information, ranging from an entire scene to every single pixel, to perform pixel-wise label estimation. The experimental results demonstrate that our method outperforms or is comparable to state-of-the-art methods on PASCAL VOC 2012 and SIFTFlow semantic segmentation datasets.
机译:近年来,目睹了使用深度卷积神经网络(DCNN)进行语义分割的巨大进步。本文提出了一种新颖的完全卷积网络,用于使用多尺度上下文卷积特征进行语义分割。由于自然图像中的对象往往具有各种比例和纵横比,因此捕获丰富的上下文信息对于密集像素预测非常关键。另一方面,当深入卷积层时,传统DCNN的卷积特征图会逐渐变粗,这可能不利于语义分割。根据这些观察,我们尝试设计一个多尺度的深度上下文卷积网络(MDCCNet),该网络以不同的方式组合了来自网络不同级别的特征图,以进行语义分割。 MDCCNet的分段输出使用密集连接的条件随机字段(CRF)进一步增强。拟议的网络使我们能够充分利用从整个场景到每个单个像素的本地和全局上下文信息,以执行像素级标签估计。实验结果表明,在PASCAL VOC 2012和SIFTFlow语义分割数据集上,我们的方法优于或可与最新方法媲美。

著录项

  • 来源
    《World Wide Web》 |2019年第2期|555-570|共16页
  • 作者单位

    Nanjing Univ Posts Telecommun, Natl Engn Res Ctr Commun & Networking, Nanjing, Jiangsu, Peoples R China|Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Fujian, Peoples R China;

    Nanjing Univ Posts Telecommun, Natl Engn Res Ctr Commun & Networking, Nanjing, Jiangsu, Peoples R China|Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Fujian, Peoples R China;

    Nanjing Univ Sci Technol, Minist Educ, Key Lab Intelligent Percept & Syst High Dimens In, Nanjing, Jiangsu, Peoples R China;

    Guizhou Normal Univ, Sch Big Data & Comp Sci, Guiyang, Guizhou, Peoples R China;

    Kyushu Inst Technol, Dept Mech & Control Engn, Kitakyushu, Fukuoka, Japan;

    Huawei Technol Co Ltd, Shenzhen, Peoples R China;

    Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Multi-scale context; MDCNNs; Semantic segmentation; CRF;

    机译:多尺度上下文;MDCNN;语义分割;CRF;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号