首页> 外文期刊>Expert systems with applications >Optimized HRNet for image semantic segmentation
【24h】

Optimized HRNet for image semantic segmentation

机译:用于图像语义分割的优化HRNET

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

With the rapid development of deep learning, image semantic segmentation has made great progress and become a hot topic in scene understanding of computer vision. In this paper, we propose an optimized high-resolution net (HRNet) for image semantic segmentation. Unlike traditional networks usually extract feature maps based on a high-to-low encoder, which may easily loss important shape and boundary details especially for the deeper layers with lower resolutions, our optimized HRNet can maintain high resolution features at all times using a relatively shallow and parallel network structure. To improve the ability of our model in better recognizing the objects with various scales and irregular shapes, we introduce a mixed dilated convolution (MDC) module, which can not only increase the diversity of the receptive fields, but also tackle the "gridding" problem commonly existing in the conventional dilated convolution. By minimizing fine detail lost based on a DUpsample strategy, we further develop a multi-level data-dependent feature aggregation (MDFA) module to enhance the capability of our network in better identifying the fine details especially for the small objects with fuzzy boundaries. We evaluate the optimized HRNet on four different datasets, including Cityscapes, Pascal VOC2012, CamVid and the KITTI. Experimental results validate the effectiveness of our method in improving the accuracy of image semantic segmentation. Comparisons with state-of-the-art methods also verify the advantages of our optimized HRNet in achieving better semantic segmentation performance.
机译:随着深度学习的快速发展,图像语义细分已经取得了很大的进步,成为对计算机愿景的现场了解的热门话题。在本文中,我们提出了一种用于图像语义分割的优化的高分辨率网(HRNET)。与传统网络不同,基于高到低编码器提取特征图,这可能很容易损失重要的形状和边界细节,特别是对于具有较低分辨率的更深层,我们的优化HRNET可以使用相对较浅的时间保持高分辨率特征和并行网络结构。为了提高我们的模型的能力,在更好地识别具有各种尺度和不规则形状的物体中,我们引入了混合扩张的卷积(MDC)模块,这不仅可以增加接收领域的多样性,而且还可以解决“网格”问题通常存在于传统的扩张卷积中。通过基于Dupsample策略丢失的精细细节,我们进一步开发了多级数据依赖性特征聚合(MDFA)模块,以提高我们网络的能力,更好地识别尤其是对于具有模糊边界的小对象的细节。我们在四个不同的数据集中评估优化的HRNET,包括城市景观,Pascal VOC2012,Camvid和Kitti。实验结果验证了我们提高图像语义分割精度的方法的有效性。最先进的方法的比较还验证了我们优化的HRNET的优势,实现了更好的语义分割性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号