A SINGLE-STAGE PEDESTRIAN DETECTOR BASED ON SSD WITH MULTI-SCALE FEATURE EXTRACTION AND RESIDUAL BLOCK

HOANH NGUYEN

摘要

Pedestrian detection is a key problem in many intelligent transport systems. In driving environment, apart from the detection accuracy, the inference speed is also a large concern. Although popular two-stage object detectors such as Faster R-CNN have achieved significant improvements in pedestrian detection accuracy, it is still slow for real-time pedestrian detection in driving environment. On the other hand, popular one-stage object detectors such as SSD have not achieved competitive detection accuracy on pedestrian detection benchmarks. This paper proposes a one-stage detector for real-time pedestrian detection in driving environment. The proposed approach is based on popular SSD framework. To improve the detection accuracy, the backbone network in original SSD framework is replaced by the backbone sub-network based on DenseNets structure, which includes stem module, dense blocks, and transition layers. With dense connection in DenseNet architecture, the proposed approach can achieve higher accuracy with fewer parameters compared with ResNet architecture. In the detection sub-network, enhanced feature extraction subnet takes convolution layers generated by the backbone sub-network to generate enhanced feature maps by fusing operation, atrous convolution and deconvolution operation. Enhanced feature maps can enhance the detection performance of multi-scale pedestrian detection. In addition, residual blocks are added before each prediction layer to reduce the computational cost and improve the detection accuracy. Experimental results on Caltech and CityPersons dataset show that the proposed approach achieves better accuracy compared with popular two-stage detectors while being faster.

机译：行人检测是许多智能传输系统中的关键问题。在驾驶环境中，除了检测精度外，推理速度也是一个很大的关注。虽然流行的双阶段对象探测器，如R-CNN的更快，但在行人检测精度方面取得了显着的改进，但对于驾驶环境中的实时行人检测仍然慢速。另一方面，SSD等流行的一级对象探测器在行人检测基准上没有实现竞争检测准确性。本文提出了一阶段检测器，用于驾驶环境中的实时行人检测。所提出的方法是基于流行的SSD框架。为了提高检测精度，原始SSD框架中的骨干网络由基于Densenets结构的骨干子网络代替，包括阀杆模块，密集块和转换层。凭借DenSenet架构中的密集连接，与Reset架构相比，所提出的方法可以通过更少的参数实现更高的准确性。在检测子网中，增强特征提取子网采用骨干子网生成的卷积层，通过熔化操作，不足的卷积和解卷积操作来生成增强功能映射。增强的特征图可以增强多尺度行人检测的检测性能。另外，在每个预测层之前添加残余块以降低计算成本并提高检测精度。 CALTECH和CITYPERSONS DATASET上的实验结果表明，与流行的两级探测器相比，该方法的准确性更好，同时更快。

A SINGLE-STAGE PEDESTRIAN DETECTOR BASED ON SSD WITH MULTI-SCALE FEATURE EXTRACTION AND RESIDUAL BLOCK

摘要

著录项

相关主题

期刊订阅