...
首页> 外文期刊>Journal of visual communication & image representation >Multi-Scale and spatial position-based channel attention network for crowd counting
【24h】

Multi-Scale and spatial position-based channel attention network for crowd counting

机译:基于空间位置的人群统计通道注意力网络

获取原文
获取原文并翻译 | 示例

摘要

Crowd counting algorithms have recently incorporated attention mechanisms into convolutional neural networks (CNNs) to achieve significant progress. The channel attention model (CAM), as a popular attention mechanism, calculates a set of probability weights to select important channel-wise feature responses. However, most CAMs roughly assign a weight to the entire channel-wise map, which makes useful and useless information being treat indiscriminately, thereby limiting the representational capacity of networks. In this paper, we propose a multi -scale and spatial position-based channel attention network (MS-SPCANet), which integrates spatial position -based channel attention models (SPCAMs) with multiple scales into a CNN. SPCAM assigns different channel attention weights to different positions of channel-wise maps to capture more informative features. Furthermore, an adaptive loss, which uses adaptive coefficients to combine density map loss and headcount loss, is constructed to improve network performance in sparse crowd scenes. Experimental results on four public datasets verify the superiority of the scheme.
机译:人群计数算法最近将注意力机制整合到卷积神经网络 (CNN) 中,取得了重大进展。通道注意力模型 (CAM) 作为一种流行的注意力机制,计算一组概率权重来选择重要的通道特征响应。然而,大多数CAM粗略地为整个信道映射分配了权重,这使得有用和无用的信息被不加区别地处理,从而限制了网络的表示能力。在本文中,我们提出了一种多尺度和基于空间位置的信道注意力网络(MS-SPCANet),该网络将具有多个尺度的空间位置信道注意力模型(SPCAMs)集成到CNN中。SPCAM将不同的信道注意力权重分配给信道地图的不同位置,以捕获更多信息特征。此外,该文还构建了一种自适应损失,利用自适应系数将密度图损失和人数损失相结合,以提高稀疏人群场景下的网络性能。在4个公开数据集上的实验结果验证了该方案的优越性。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号