Multi-Scale and spatial position-based channel attention network for crowd counting

Wang Lin; Li Jie; Zhang SiqiQi ChunWang PanWang Fengping

首页> 外文期刊>Journal of visual communication & image representation >Multi-Scale and spatial position-based channel attention network for crowd counting

【24h】

Multi-Scale and spatial position-based channel attention network for crowd counting

机译：基于空间位置的人群统计通道注意力网络

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相关主题

摘要

Crowd counting algorithms have recently incorporated attention mechanisms into convolutional neural networks (CNNs) to achieve significant progress. The channel attention model (CAM), as a popular attention mechanism, calculates a set of probability weights to select important channel-wise feature responses. However, most CAMs roughly assign a weight to the entire channel-wise map, which makes useful and useless information being treat indiscriminately, thereby limiting the representational capacity of networks. In this paper, we propose a multi -scale and spatial position-based channel attention network (MS-SPCANet), which integrates spatial position -based channel attention models (SPCAMs) with multiple scales into a CNN. SPCAM assigns different channel attention weights to different positions of channel-wise maps to capture more informative features. Furthermore, an adaptive loss, which uses adaptive coefficients to combine density map loss and headcount loss, is constructed to improve network performance in sparse crowd scenes. Experimental results on four public datasets verify the superiority of the scheme.

机译：人群计数算法最近将注意力机制整合到卷积神经网络（CNN）中，取得了重大进展。通道注意力模型（CAM）作为一种流行的注意力机制，计算一组概率权重来选择重要的通道特征响应。然而，大多数CAM粗略地为整个信道映射分配了权重，这使得有用和无用的信息被不加区别地处理，从而限制了网络的表示能力。在本文中，我们提出了一种多尺度和基于空间位置的信道注意力网络（MS-SPCANet），该网络将具有多个尺度的空间位置信道注意力模型（SPCAMs）集成到CNN中。SPCAM将不同的信道注意力权重分配给信道地图的不同位置，以捕获更多信息特征。此外，该文还构建了一种自适应损失，利用自适应系数将密度图损失和人数损失相结合，以提高稀疏人群场景下的网络性能。在4个公开数据集上的实验结果验证了该方案的优越性。

著录项

来源
《Journal of visual communication & image representation》 |2023年第2期|103718.1-103718.12|共12页
作者
Wang Lin; Li Jie; Zhang SiqiQi ChunWang PanWang Fengping;
展开▼
作者单位

Xi An Jiao Tong Univ;

Xian Modern Control Technol Res Inst;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Crowd counting; Spatial position -based channel attention model; Multi -scale structure; Adaptive loss;

机译：人流统计;基于空间位置的通道注意力模型;多尺度结构;自适应损失;

Multi-Scale and spatial position-based channel attention network for crowd counting

摘要

著录项

引文网络

相关主题

期刊订阅