...
首页> 外文期刊>ISPRS Journal of Photogrammetry and Remote Sensing >Learning deep cross-scale feature propagation for indoor semantic segmentation
【24h】

Learning deep cross-scale feature propagation for indoor semantic segmentation

机译:学习室内语义分割的深度串尺度特征传播

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Indoor semantic segmentation is a long-standing vision task that has been recently advanced by convolutional neural networks (CNNs), but this task remains challenging by high occlusion and large scale variation of indoor scenes. Existing CNN-based methods mainly focus on using auxiliary depth data to enrich features extracted from RGB images, hence, they pay less attention to exploiting multi-scale information in exracted features, which is essential for distinguishing objects in highly cluttered indoor scenes. This paper proposes a deep cross-scale feature propagation network (CSNet), to effectively learn and fuse multi-scale features for robust semantic segmentation of indoor scene images. The proposed CSNet is deployed as an encoder-decoder engine. During encoding, the CSNet propagates contextual information across scales and learn discriminative multi-scale features, which are robust to large object scale variation and indoor occlusion. The decoder of CSNet then adaptively integrates the multi-scale encoded features with fusion supervision at all scales to generate target semantic segmentation prediction. Extensive experiments conducted on two challenging benchmarks demonstrate that the CSNet can effectively learn multi-scale representations for robust indoor semantic segmentation, achieving outstanding performance with mIoU scores of 51.5 and 50.8 on NYUDv2 and SUN-RGBD datasets, respectively.
机译:室内语义细分是一项长期愿景任务,最近被卷积神经网络(CNNS)推出,但是该任务仍然受到室内场景的高闭塞和大规模变化的挑战。现有的基于CNN的方法主要专注于使用辅助深度数据来丰富从RGB图像中提取的功能,因此,它们不注意利用剥削特征中的多尺度信息,这对于区分高度杂乱的室内场景是必不可少的。本文提出了深度跨尺度特征传播网络(CSNet),以有效学习和保险熔断多尺度特征,用于室内场景图像的强大语义分割。所提出的CSNet部署为编码器解码器引擎。在编码期间,CSNet在跨尺度上传播上下文信息并学习判别的多尺度特征,这对大型对象变化和室内遮挡具有鲁棒性。然后,CSNET的解码器随心全相地将多尺度编码特征与所有尺度的融合监控进行了集成,以生成目标语义分段预测。在两个具有挑战性的基准上进行的广泛实验表明,CSNET分别可以有效地学习强大的室内语义分割的多种表现,分别在NYUDV2和SUN-RGBD数据集上实现了51.5和50.8的卓越性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号