首页> 外文期刊>Quality Control, Transactions >Depth-Wise Asymmetric Bottleneck With Point-Wise Aggregation Decoder for Real-Time Semantic Segmentation in Urban Scenes
【24h】

Depth-Wise Asymmetric Bottleneck With Point-Wise Aggregation Decoder for Real-Time Semantic Segmentation in Urban Scenes

机译:深度明智的不对称瓶颈,具有点明智的聚合解码器,用于城市场景中的实时语义细分

获取原文
获取原文并翻译 | 示例
       

摘要

Semantic segmentation is a process of linking each pixel in an image to a class label, and is widely used in the field of autonomous vehicles and robotics. Although deep learning methods have already made great progress for semantic segmentation, they either achieve great results with numerous parameters or design lightweight models but heavily sacrifice the segmentation accuracy. Because of the strict requirements of real-world applications, it is critical to design an effective real-time model with both competitive segmentation accuracy and small model capacity. In this paper, we propose a lightweight network named DABNet, which employs Depth-wise Asymmetric Bottleneck (DAB) and Point-wise Aggregation Decoder (PAD) module to tackle the challenging real-time semantic segmentation in urban scenes. Specifically, the DAB module creates a sufficient receptive field and densely utilizes the contextual information, and the PAD module aggregates the feature maps of different scales to optimize performance through the attention mechanism. Compared with existing methods, our network substantially reduces the number of parameters but still achieves high accuracy with real-time inference ability. Extensive ablation experiments on two challenging urban scene datasets (Cityscapes and CamVid) have proved the effectiveness of the proposed approach in real-time semantic segmentation.
机译:语义分割是将图像中的每个像素链接到类标签的过程,并且广泛用于自主车辆和机器人领域。虽然深度学习方法已经为语义细分进行了巨大的进展,但它们要么通过许多参数或设计轻量级模型实现了很大的结果,但是很大地牺牲了分割准确性。由于对现实世界应用的严格要求,设计具有竞争性分割精度和小型型号的有效实时模型至关重要。在本文中,我们提出了一个名为Dabnet的轻量级网络,它采用深度明智的不对称瓶颈(DAB)和点明智的聚合解码器(PAD)模块来解决城市场景中的具有挑战性的实时语义分段。具体地,DAB模块创建了足够的接收字段,并且密集地利用上下文信息,焊盘模块聚合不同尺度的特征映射以通过注意机制优化性能。与现有方法相比,我们的网络基本上减少了参数的数量,但仍然实现了具有实时推理能力的高精度。在两个具有挑战性的城市场景数据集(城市景观和Camvid)上进行了广泛的消融实验已经证明了所提出的方法在实时语义细分中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号