首页> 外文会议>IEEE International Conference on Multimedia and Expo >Crowd Counting via Multi-view Scale Aggregation Networks
【24h】

Crowd Counting via Multi-view Scale Aggregation Networks

机译:通过多视图秤聚合网络计数人群计数

获取原文

摘要

Crowd counting, aiming at estimating the total number of people in unconstrained crowded scenes, has increasingly received attention. But it is greatly challenged by the huge variation in people scale. In this paper, we propose a novel Multi-View Scale Aggregation Network (MVSAN), which handle the scale variation from feature, input and criterion view comprehensively. Firstly, we design a simple but effective Multi-Scale Feature Encoder, which exploits dilated convolution layers with various dilation rates to improve the representation ability and scale diversity of features. Secondly, we feed multiple scales of input images into networks to generate high-quality density maps in a coarse-to-fine manner. Finally, we propose a Multi-Scale Structural Similarity loss to force our networks to learn the local correlation of density maps. Extensive experiments on two standard benchmarks show that the proposed method can generate high-quality crowd density map and accurate count estimation, outperforming the state-of-the-art methods with a large margin.
机译:人群计数,旨在估计无关挤在一起拥挤的场景中的人数,越来越受到关注。但是,人们规模的巨大变异是大大挑战。在本文中,我们提出了一种新的多视图级聚合网络(MVSAN),其综合地处理特征,输入和标准视图的比例变化。首先,我们设计一个简单但有效的多尺度特征编码器,它利用具有各种扩张速率的扩张卷积层,以提高特征的表示能力和规模分集。其次,我们将多个输入图像的多个尺度馈送到网络中以产生以粗略的方式产生高质量密度图。最后,我们提出了一种多规模的结构相似性损失,以强制我们的网络来学习密度图的本地相关性。两个标准基准的广泛实验表明,该方法可以产生高质量的人群密度图和准确的计数估计,优于具有大边距的最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号