首页> 外文会议>Conference on Pattern Recognition and Tracking;Society of Photo-Optical Instrumentation Engineers >A Crowd Counting Method Based on Multi-column Dilated Convolutional Neural Network
【24h】

A Crowd Counting Method Based on Multi-column Dilated Convolutional Neural Network

机译:基于多列膨胀卷积神经网络的人群计数方法

获取原文

摘要

Crowd counting is an important part of crowd analysis, which is of great significance to crowd control and management.The convolutional neural network (CNN) based crowd counting method is widely used to solve the problem ofinsufficient counting accuracy due to heavy occlusion, background clutters, head scale and perspective changes in crowdscenes. The multi-column convolutional neural network (MCNN) is a CNN-based method for crowd counting, whichadapts to head scale variation of crowd scenes by constructing multi-column convolutional neural network composing ofthree single-column networks corresponding to the convolution kernel with different sizes (large, medium and small).However, as the MCNN network is relatively shallow, its receptive field is also limited, which affects the adaptability tolarge scale variations. In addition, due to insufficient training data, it is necessary to carry out a pre-training strategieswhich pre-trains the single-column convolutional neural network individually and combines the cumbersome. In thispaper, a crowd counting method based on multi-column dilated convolutional neural network was proposed. Dilatedconvolution was used to enhance the receptive field of the network, so as to be better adaptive to the head scalevariations. The image patches were obtained by randomly clipping from the original training data set images in theprocess of each iterative training to further expand the training data, while the training could be achieved without tediouspre-training. The experimental results on ShanghaiTech public dataset showed that the accuracy of crowd countingproposed in this paper was better than that of MCNN, which proved that this method is more robust to head scalevariations in crowd scenes.
机译:人群计数是人群分析的重要组成部分,对人群的控制和管理具有重要意义。 基于卷积神经网络(CNN)的人群计数方法被广泛用于解决 由于严重的咬合,背景杂乱,头部比例和人群的视角变化,导致计数准确性不足 场景。多列卷积神经网络(MCNN)是基于CNN的人群计数方法,该方法 通过构建多列卷积神经网络来适应人群场景的头部尺度变化 与大小不同(大,中,小)的卷积内核相对应的三个单列网络。 但是,由于MCNN网络相对较浅,其接受域也受到限制,这影响了对神经网络的适应性。 大规模的变化。另外,由于培训数据不足,有必要进行预培训策略 它会分别预先训练单列卷积神经网络并合并繁琐的操作。在这个 提出了一种基于多列膨胀卷积神经网络的人群计数方法。膨胀的 卷积被用来增强网络的接收场,以便更好地适应头部尺度 变化。图像补丁是通过随机裁剪原始训练数据集图像中的图像而获得的。 每次迭代训练的过程,以进一步扩展训练数据,而无需繁琐的训练即可完成训练 预训练。 ShanghaiTech公开数据集上的实验结果表明,人群计数的准确性 本文提出的方法要比MCNN的方法要好,这证明了该方法对头部比例尺的鲁棒性 人群场景的变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号