首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >Crowd Counting via Scale-Adaptive Convolutional Neural Network
【24h】

Crowd Counting via Scale-Adaptive Convolutional Neural Network

机译:通过比例自适应卷积神经网络进行人群计数

获取原文

摘要

The task of crowd counting is to automatically estimate the pedestrian number in crowd images. To cope with the scale and perspective changes that commonly exist in crowd images, state-of-the-art approaches employ multi-column CNN architectures to regress density maps of crowd images. Multiple columns have different receptive fields corresponding to pedestrians (heads) of different scales. We instead propose a scale-adaptive CNN (SaCNN) architecture with a backbone of fixed small receptive fields. We extract feature maps from multiple layers and adapt them to have the same output size; we combine them to produce the final density map. The number of people is computed by integrating the density map. We also introduce a relative count loss along with the density map loss to improve the network generalization on crowd scenes with few pedestrians, where most representative approaches perform poorly on. We conduct extensive experiments on the ShanghaiTech, UCF_CC_50 and WorldExpo'10 datasets as well as a new dataset SmartCity that we collect for crowd scenes with few people. The results demonstrate significant improvements of SaCNN over the state-of-the-art.
机译:人群计数的任务是自动估计人群图像中的行人人数。为了应对人群图像中普遍存在的比例尺和视角变化,最新技术采用多列CNN架构来回归人群图像的密度图。多列具有对应于不同规模的行人(头部)的不同感受野。相反,我们提出了一种具有规模的自适应CNN(SaCNN)体系结构,该体系结构具有固定的小接受域的主干。我们从多个图层中提取要素图,并对其进行调整以使其具有相同的输出大小;我们将它们结合起来以生成最终的密度图。通过积分密度图来计算人数。我们还引入了相对计数损失和密度图损失,以改善在行人少的人群场景中网络的泛化性,在这些场景中,大多数代表性方法的效果较差。我们对ShanghaiTech,UCF_CC_50和WorldExpo'10数据集以及为少数人的人群场景收集的新数据集SmartCity进行了广泛的实验。结果表明,与最新技术相比,SaCNN有了显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号