Crowd Counting via Multi-view Scale Aggregation Networks

机译：通过多视图秤聚合网络计数人群计数

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Crowd counting, aiming at estimating the total number of people in unconstrained crowded scenes, has increasingly received attention. But it is greatly challenged by the huge variation in people scale. In this paper, we propose a novel Multi-View Scale Aggregation Network (MVSAN), which handle the scale variation from feature, input and criterion view comprehensively. Firstly, we design a simple but effective Multi-Scale Feature Encoder, which exploits dilated convolution layers with various dilation rates to improve the representation ability and scale diversity of features. Secondly, we feed multiple scales of input images into networks to generate high-quality density maps in a coarse-to-fine manner. Finally, we propose a Multi-Scale Structural Similarity loss to force our networks to learn the local correlation of density maps. Extensive experiments on two standard benchmarks show that the proposed method can generate high-quality crowd density map and accurate count estimation, outperforming the state-of-the-art methods with a large margin.

机译：人群计数，旨在估计无关挤在一起拥挤的场景中的人数，越来越受到关注。但是，人们规模的巨大变异是大大挑战。在本文中，我们提出了一种新的多视图级聚合网络（MVSAN），其综合地处理特征，输入和标准视图的比例变化。首先，我们设计一个简单但有效的多尺度特征编码器，它利用具有各种扩张速率的扩张卷积层，以提高特征的表示能力和规模分集。其次，我们将多个输入图像的多个尺度馈送到网络中以产生以粗略的方式产生高质量密度图。最后，我们提出了一种多规模的结构相似性损失，以强制我们的网络来学习密度图的本地相关性。两个标准基准的广泛实验表明，该方法可以产生高质量的人群密度图和准确的计数估计，优于具有大边距的最先进的方法。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2019年|621p|共6页
会议地点
作者
Zhilin Qiu; Lingbo Liu; Guanbin Li; Qing Wang; Nong Xiao; Liang Lin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类多媒体技术与多媒体计算机;
关键词
Feature extraction; Kernel; Feeds; Correlation; Convolution; Network architecture; Neural networks;

机译：特征提取;核;饲料;相关;卷积;网络架构;神经网络;

相似文献

外文文献
中文文献
专利

1. Crowd counting via scale-communicative aggregation networks [J] . Yuan Lixian, Qiu Zhilin, Liu Lingbo, Neurocomputing . 2020,第Octa7期

机译：通过尺度传播聚合网络计数人群
2. Crowd counting via learning perspective for multi-scale multi-view Web images [J] . Shang Chong, Ai Haizhou, Yang Yi Frontiers of computer science in China . 2019,第3期

机译：通过学习角度进行人群计数，以实现多尺度多视图Web图像
3. Crowd counting via learning perspective for multi-scale multi-view Web images [J] . Shang Chong, Ai Haizhou, Yang Yi Frontiers of computer science . 2019,第3期

机译：人群通过学习的多尺度多视图网页映射计数
4. Crowd Counting via Multi-view Scale Aggregation Networks [C] . Zhilin Qiu, Lingbo Liu, Guanbin Li, IEEE International Conference on Multimedia and Expo . 2019

机译：通过多视图规模聚合网络进行人群计数
5. Automated Crowd-Counting System upon a Distributed Camera Network. [D] . Morrow, Mulloy. 2012

机译：分布式摄像机网络上的自动人群计数系统。
6. Hierarchical multi-view aggregation network for sensor-based human activity recognition [O] . Xiheng Zhang, Yongkang Wong, Mohan S. Kankanhalli, 2012

机译：分层的多视图聚合网络，用于基于传感器的人类活动识别
7. CASA-Crowd: A Context-Aware Scale Aggregation CNN-Based Crowd Counting Technique [O] . Naveed Ilyas, Ashfaq Ahmad, Kiseon Kim 2019

机译：Casa-Crowd：一种基于背景的语境知识分比CNN的人群计数技术
8. Recursive Aggregation-Disaggregation Method to Approximate Large-Scale Closed Queueing Networks with Multiple Job Types [R] . Vandoremalen, J., Wessels, J. 1988

机译：具有多种作业类型的大规模封闭排队网络的递归聚合 - 分解方法

Crowd Counting via Multi-view Scale Aggregation Networks

摘要

著录项

相似文献

相关主题

期刊订阅