...
首页> 外文期刊>Industrial Informatics, IEEE Transactions on >Multiscale Multitask Deep NetVLAD for Crowd Counting
【24h】

Multiscale Multitask Deep NetVLAD for Crowd Counting

机译:用于人群计数的多尺度多任务深度NetVLAD

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Deep convolutional networks (CNNs) reign undisputed as the new de-facto method for computer vision tasks owning to their success in visual recognition task on still images. However, their adaptations to crowd counting have not clearly established their superiority over shallow models. Existing CNNs turn out to be self-limiting in challenging scenarios such as camera illumination changing, partial occlusions, diverse crowd distributions, and perspective distortions for crowd counting because of their shallow structure. In this paper, we introduce a dynamic augmentation technique to train a much deeper CNN for crowd counting. In order to decrease overfitting caused by limited number of training samples, multitask learning is further employed to learn generalizable representations across similar domains. We also propose to aggregate multiscale convolutional features extracted from the entire image into a compact single vector representation amenable to efficient and accurate counting by way of “Vector of Locally Aggregated Descriptors” (VLAD). The “deeply supervised” strategy is employed to provide additional supervision signal for bottom layers for further performance improvement. Experimental results on three benchmark crowd datasets show that our method achieves better performance than the existing methods. Our implementation will be released at https://github.com/shizenglin/Multitask-Multiscale-Deep-NetVLAD.
机译:深度卷积网络(CNN)作为计算机视觉任务的事实上的新方法而无可争议,这要归功于它们在静止图像上的视觉识别任务中的成功。但是,他们对人群计数的适应性并未明确确立其优于浅层模型的优势。事实证明,现有的CNN在具有挑战性的场景中具有自限性,例如摄像机照明变化,部分遮挡,人群分布多样以及由于人群浅而造成的人群计数透视失真。在本文中,我们引入了动态增强技术来训练更深的CNN以进行人群计数。为了减少由有限数量的训练样本引起的过拟合,进一步采用多任务学习来学习跨相似域的可概括表示。我们还建议通过“局部聚合描述符向量”(VLAD)将从整个图像中提取的多尺度卷积特征聚合为一个紧凑的单个向量表示形式,以实现高效,准确的计数。 “深度监督”策略用于为底层提供额外的监督信号,以进一步提高性能。在三个基准人群数据集上的实验结果表明,我们的方法比现有方法具有更好的性能。我们的实现将在https://github.com/shizenglin/Multitask-Multiscale-Deep-NetVLAD上发布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号