首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >A Real-Time Image Semantic Segmentation Method Based on Multilabel Classification
【24h】

A Real-Time Image Semantic Segmentation Method Based on Multilabel Classification

机译:基于Multilabel分类的实时映像语义分割方法

获取原文
           

摘要

Image semantic segmentation as a kind of technology has been playing a crucial part in intelligent driving, medical image analysis, video surveillance, and AR. However, since the scene needs to infer more semantics from video and audio clips and the request for real-time performance becomes stricter, whetherthe single-label classification method that was usually used before or the regular manual labeling cannot meet this end. Given the excellent performance of deep learning algorithms in extensive applications, the image semantic segmentation algorithm based on deep learning framework has been brought under the spotlight of development. This paper attempts to improve the ESPNet (Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation) based on the multilabel classification method by the following steps. First, the standard convolution is replaced by applying Receptive Field in Deep Convolutional Neural Network in the convolution layer, to the extent that every pixel in the covered area would facilitate the ultimate feature response. Second, the ASPP (Atrous Spatial Pyramid Pooling) module is improved based on the atrous convolution, and the DB-ASPP (Delate Batch Normalization-ASPP) is proposed as a way to reducing gridding artifacts due to the multilayer atrous convolution, acquiring multiscale information, and integrating the feature information in relation to the image set. Finally, the proposed model and regular models are subject to extensive tests and comparisons on a plurality of multiple data sets. Results show that the proposed model demonstrates a good accuracy of segmentation, the smallest network parameter at 0.3?M and the fastest speed of segmentation at 25 FPS.
机译:图像语义分割作为一种技术一直扮演着智能驾驶,医学图像分析,视频监控和AR的重要组成部分。然而,由于现场需要推断视频和音频剪辑和实时性能的要求变得更严格的语义,whetherthe单标签分类方法之前,通常使用或常规的手动贴标不能满足这一目的。鉴于深学习算法中广泛应用的优良性能的基础上,深度学习框架,图像语义分割算法已被开发的聚光灯之下。基于由以下步骤多标记分类方法本文试图改善ESPNet(高效空间扩张型循环卷积的语义分割金字塔)。首先,标准的卷积是通过在卷积层施加感受野在深卷积神经网络,在某种程度上,在覆盖区域中的每个像素将有利于最终的特征响应代替。其次,ASPP(Atrous空间金字塔池)模块是基于atrous卷积提高,并且DB-ASPP(啦免费统计批标准化-ASPP)被提出作为一种方法来减少网格伪影归因于多层atrous卷积,获得多尺度信息和集成相对于图像集的特征信息。最后,该模型与普通机型都受到在多个多个数据集广泛的测试和比较。结果表明,该模型表明分割的精度好,在0.3?m最小的网络参数和分割在25 FPS最快的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号