首页> 外文期刊>Frontiers of computer science in China >Crowd counting via learning perspective for multi-scale multi-view Web images
【24h】

Crowd counting via learning perspective for multi-scale multi-view Web images

机译:通过学习角度进行人群计数,以实现多尺度多视图Web图像

获取原文
获取原文并翻译 | 示例
           

摘要

Estimating the number of people in Web images still remains a challenging problem owing to the perspective variation, different views, and diverse backgrounds. Existing deep learning models still have difficulties in dealing with scenarios where the size of a person is either extremely large or extremely small. In this paper, we propose a novel perspective-aware architecture to estimate the number of people in a crowd in web images. Specifically, we use a two-stage framework, where we first learn a policy network to infer the perspective of the target scene, which outputs a scale label for the subsequent perspective normalization. Next, given the aligned inputs, we further adjust the scale-specific counting network to regress the final count. Experiments on challenging datasets demonstrate our approach can deal with a large perspective variation and that we have achieved state-of-theart results.
机译:由于视角的变化,不同的观点和不同的背景,估计Web图像中的人数仍然是一个具有挑战性的问题。现有的深度学习模型在处理人的大小非常大或非常小的场景时仍然存在困难。在本文中,我们提出了一种新颖的透视感知架构,以估计Web图像中人群的人数。具体来说,我们使用一个两阶段的框架,在该框架中,我们首先学习一个策略网络来推断目标场景的视角,该网络会为随后的视角标准化输出比例标签。接下来,给定对齐的输入,我们进一步调整特定于规模的计数网络以回归最终计数。在具有挑战性的数据集上进行的实验表明,我们的方法可以处理较大的视角变化,并且已经取得了最新的成果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号