Learning Multi-Level Density Maps for Crowd Counting

Jiang Xiaoheng; Zhang Li; Lv Pei; Guo Yibo; Zhu Ruijie; Li Yafei; Pang Yanwei; Li Xi; Zhou Bing; Xu Mingliang

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Learning Multi-Level Density Maps for Crowd Counting

【24h】

Learning Multi-Level Density Maps for Crowd Counting

机译：学习人群计数的多级密度图

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

People in crowd scenes often exhibit the characteristic of imbalanced distribution. On the one hand, people size varies largely due to the camera perspective. People far away from the camera look smaller and are likely to occlude each other, whereas people near to the camera look larger and are relatively sparse. On the other hand, the number of people also varies greatly in the same or different scenes. This article aims to develop a novel model that can accurately estimate the crowd count from a given scene with imbalanced people distribution. To this end, we have proposed an effective multi-level convolutional neural network (MLCNN) architecture that first adaptively learns multi-level density maps and then fuses them to predict the final output. Density map of each level focuses on dealing with people of certain sizes. As a result, the fusion of multi-level density maps is able to tackle the large variation in people size. In addition, we introduce a new loss function named balanced loss (BL) to impose relatively BL feedback during training, which helps further improve the performance of the proposed network. Furthermore, we introduce a new data set including 1111 images with a total of 49 061 head annotations. MLCNN is easy to train with only one end-to-end training stage. Experimental results demonstrate that our MLCNN achieves state-of-the-art performance. In particular, our MLCNN reaches a mean absolute error (MAE) of 242.4 on the UCF_CC_50 data set, which is 37.2 lower than the second-best result.

机译：人群场景中的人经常表现出分布不平衡的特征。一方面，由于相机的角度，人们的大小在很大程度上变化。远离相机的人看起来更小，很可能会互相遮挡，而靠近相机的人看起来更大，并且相对稀疏。另一方面，人数也在相同或不同的场景中变化。本文旨在开发一种新型模型，可以准确地估计来自特定场景的人群计数，人们分发不平衡。为此，我们提出了一种有效的多级卷积神经网络（MLCNN）架构，首先自适应地学习多级密度映射，然后使其熔化以预测最终输出。每个级别的密度图侧重于处理某些尺寸的人。结果，多级密度图的融合能够解决人们大小的大变化。此外，我们介绍了一个名为均衡丢失（BL）的新损失函数，以在训练期间强加相对的BL反馈，这有助于进一步提高所提出的网络的性能。此外，我们介绍了一个新的数据集，包括1111个图像，总共49个061个头注释。 MLCNN易于培训，只能用一个端到端的训练阶段训练。实验结果表明，我们的MLCNN实现了最先进的性能。特别是，我们的MLCNN在UCF_CC_50数据集上达到242.4的平均绝对误差（MAE），这是37.2低于第二个最佳结果。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2020年第8期|2705-2715|共11页
作者
Jiang Xiaoheng; Zhang Li; Lv Pei; Guo Yibo; Zhu Ruijie; Li Yafei; Pang Yanwei; Li Xi; Zhou Bing; Xu Mingliang;
展开▼
作者单位

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China;

Zhejiang Univ Sch Comp Sci & Technol Hangzhou 310058 Zhejiang Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450001 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cameras; Head; Estimation; Feature extraction; Switches; Training; Adaptation models; Balanced loss (BL); convolutional neural network (CNN); crowd counting; multi-level density maps;

机译：相机;头;估计;特征提取;开关;培训;适应模型;平衡损失（BL）;卷积神经网络（CNN）;人群计数;多级密度图;

相似文献

外文文献
中文文献
专利

1. Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks—Counting, Detection, and Tracking [J] . Kang Di, Ma Zheng, Chan Antoni B. IEEE Transactions on Circuits and Systems for Video Technology . 2019,第5期

机译：超越计数：人群分析任务的密度图比较-计数，检测和跟踪
2. Crowd counting by using multi-level density-based spatial information: A Multi-scale CNN framework [J] . Information Sciences: An International Journal . 2020,第期

机译：人群计数通过使用基于多级密度的空间信息：多级CNN框架
3. Tracking-by-Counting: Using Network Flows on Crowd Density Maps for Tracking Multiple Targets [J] . Weihong Ren, Xinchao Wang, Jiandong Tian, IEEE Transactions on Image Processing . 2021,第1期

机译：逐次计数：使用人群密度映射上的网络流程用于跟踪多个目标
4. Crowd Counting Via Multi-Level Regression With Latent Gaussian Maps [C] . Yukang Gao, Hua Yang IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：通过与潜在高斯地图的多级回归计数人群
5. Deep Learning to Predict Protein Backbone Structure from High-Resolution Cryo- EM Density Maps [D] . Moritz, Spencer. 2019

机译：深度学习根据高分辨率Cryo-EM密度图预测蛋白质骨架结构
6. Counting Crowds with Perspective Distortion Correction via Adaptive Learning [O] . Yixuan Sun, Jian Jin, Xingjiao Wu, 2020

机译：通过自适应学习计算具有透视扭曲校正的人群
7. Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks - Counting, Detection, and Tracking [O] . Kang, Di, Ma, Zheng, Chan, Antoni B. 2017

机译：超越计数：人群分析任务的密度图比较 - 计数，检测和跟踪

Learning Multi-Level Density Maps for Crowd Counting

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅