【24h】

A Comparison of Embedded Deep Learning Methods for Person Detection

机译:嵌入式深度学习方法对人检测的比较

获取原文
获取外文期刊封面目录资料

摘要

Recent advancements in parallel computing, GPU technology and deep learning provide a new platform for complex image processing tasks such as person detection to flourish. Person detection is fundamental preliminary operation for several high level computer vision tasks. One industry that can significantly benefit from person detection is retail. In recent years, various studies attempt to find an optimal solution for person detection using neural networks and deep learning. This study conducts a comparison among the state of the art deep learning base object detector with the focus on person detection performance in indoor environments. Performance of various implementations of YOLO, SSD, RCNN, R-FCN and SqueezeDet have been assessed using our in-house proprietary dataset which consists of over 10 thousands indoor images captured form shopping malls, retails and stores. Experimental results indicate that, Tiny YOLO-416 and SSD (VGG-300) are the fastest and Faster-RCNN (Inception ResNet-v2) and R-FCN (ResNet-101) are the most accurate detectors investigated in this study. Further analysis shows that YOLO v3-416 delivers relatively accurate result in a reasonable amount of time, which makes it an ideal model for person detection in embedded platforms.
机译:并行计算的最新进展,GPU技术和深度学习为复杂的图像处理任务提供了一种新的平台,例如人员检测蓬勃发展。人员检测是几个高级计算机视觉任务的基本初步操作。一个可以从人物检测中受益的一个行业是零售。近年来,各种研究试图使用神经网络和深度学习找到人员检测的最佳解决方案。本研究在艺术深度学习基础对象检测器的状态进行比较,其专注于室内环境中的人检测性能。使用我们的内部专有数据集进行了各种实现的yolo,SSD,RCNN,R-FCN和Screezedet的性能,该数据集由超过10数千万的室内图像捕获的代表商场,零售和商店。实验结果表明,微小的YOLO-416和SSD(VGG-300)是最快,RCNN(Inception Resnet-V2)和R-FCN(Resnet-101)是本研究中研究的最准确的探测器。进一步的分析表明,YOLO V3-416以合理的时间提供相对准确的结果,这使其成为嵌入式平台中的人员检测的理想模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号