首页> 外文期刊>Multimedia Tools and Applications >Multi-scale dilated convolution of convolutional neural network for crowd counting
【24h】

Multi-scale dilated convolution of convolutional neural network for crowd counting

机译:卷积神经网络的多尺度扩张卷积用于人群计数

获取原文
获取原文并翻译 | 示例
           

摘要

Growing numbers of crowd density estimation methods have been developed in scene monitoring, crowd safety and on-site management scheduling. We proposed a method for density estimation of a single static image based on convolutional neural network naming Multi-scale Dilated Convolution of Convolutional Neural Network (Multi-scale-CNN). The proposed method employed the method of density maps regression to learn the mapping relationship between single-image and density maps through convolutional neural network. The adopted network structure is composed of two major components to adapt changes of characters scales in crowd images, a convolutional neural network for the general feature extraction and the other is multi-scale dilated convolution for disposing the scale change problem. It is insufficient for currently study that tackled the multi-column or multi-input convolutional neural networks to solve multi-scale problems. Our method utilizes a single-column network to extract features and combines multi-scale dilated convolution to aggregate multi-scale information to address the shortcomings of two networks. The multi-scale dilated convolution module aggregates multi-scale context information systematically by making use of dilated convolution without reducing the receiving domain, thereby integrate the underlying detail information into the high-level semantic features to promote the perception and counting ability of network for small targets. This paper demonstrates the proposed network structure in ShanghaiTech dataset, UCF_CC_50 dataset and worldexpo'10 dataset, and compares the results with numbers of current mainstream crowd counting algorithms, proves that our method surpasses current state-of-the-art methods and has excellent counting accuracy and robustness. The training and testing codes of our method models can be downloaded at https://github.com/doctorwgd/Multi-scale-CNN.
机译:在场景监视,人群安全和现场管理调度中,已经开发了越来越多的人群密度估计方法。我们提出了一种基于卷积神经网络的卷积神经网络命名方法,用于对单个静态图像进行密度估计。该方法采用了密度图回归的方法,通过卷积神经网络学习了单张图像与密度图之间的映射关系。所采用的网络结构由两个主要部分组成,以适应人群图像中字符尺度的变化,一个用于一般特征提取的卷积神经网络,另一个是用于解决尺度变化问题的多尺度膨胀卷积。对于目前研究多列或多输入卷积神经网络以解决多尺度问题而言,这是不够的。我们的方法利用单列网络提取特征,并结合多尺度膨胀卷积以聚合多尺度信息,以解决两个网络的缺点。多尺度扩张卷积模块通过利用扩张卷积来系统地聚合多尺度上下文信息,而不会减少接收域,从而将底层细节信息集成到高级语义特征中,以提高网络对小规模用户的感知和计数能力。目标。本文在ShanghaiTech数据集,UCF_CC_50数据集和worldexpo'10数据集中演示了拟议的网络结构,并将结果与​​当前主流人群计数算法的数量进行了比较,证明了我们的方法超越了当前的最新方法并且具有出色的计数能力准确性和鲁棒性。我们方法模型的训练和测试代码可以从https://github.com/doctorwgd/Multi-scale-CNN下载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号