首页> 外文会议>International Conference on Field Programmable Technology >An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA
【24h】

An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA

机译:在FPGA上使用全流水线二值化CNN基于多尺度滑动窗口搜索的对象检测器

获取原文

摘要

An object detection problem consists of two problems: one is classification of detected object category and the other is localization. Frame object detection is used in an embedded vision systems, such as a robot, an automobile, a security camera, and a drone. These applications require high-performance computation and low-power consumption by an inexpensive device. This paper proposes multiscale sliding window based object detector using a fully pipelined binarized deep convolutional neural network (BCNN) on an FPGA. It consists of a sliding window part, a fully pipelined BCNN classifier, and an ARM processing unit for detection. Duplicate detections were filtered by using a non-maximum suppression algorithm running on the ARM processor. We propose the fully pipelined layers for the BCNN and its architecture for FPGA realization. Since the proposed BCNN circuit uses on-chip memories on the FPGA, its throughput is higher than a GPU based one with practical recognition accuracy. We trained the VGG11 based BCNN using the KITTI vision benchmark for the car detection scenario. Then, we implemented the proposed object detector on the Xilinx Inc. Zynq UltraScale+ MPSoC zcu102 evaluation board. The GPU based object detectors were too slow for the realtime application requirement (HD frame rate), with the exception of YOLOv2. As compared with the GPU implementation of YOLOv2, the proposed FPGA detector had higher recognition accuracy and lower power consumption. Compared with the YOLOv2, the proposed FPGA one is higher with respect to recognition accuracy, and its power consumption is lower than the GPU based YOLOv2. Thus, the FPGA based object detector suitable for the embedded realtime applications.
机译:对象检测问题包括两个问题:一个是检测到的对象类别的分类,另一个是定位。框架对象检测用于嵌入式视觉系统中,例如机器人,汽车,安全摄像机和无人机。这些应用要求通过廉价的设备进行高性能计算并降低功耗。本文提出了一种在FPGA上使用完全流水线的二值化深度卷积神经网络(BCNN)的基于多尺度滑动窗口的目标检测器。它由一个滑动窗口部分,一个完全流水线的BCNN分类器和一个用于检测的ARM处理单元组成。通过使用在ARM处理器上运行的非最大抑制算法来过滤重复的检测。我们为BCNN及其架构实现FPGA提出了全流水线层。由于建议的BCNN电路在FPGA上使用了片上存储器,因此其吞吐量要比具有实际识别精度的基于GPU的吞吐量高。我们使用KITTI视觉基准测试了基于VGG11的BCNN,以进行汽车检测。然后,我们在Xilinx Inc. Zynq UltraScale + MPSoC zcu102评估板上实现了拟议的目标检测器。除了YOLOv2之外,基于GPU的对象检测器对于实时应用要求(HD帧速率)来说太慢了。与YOLOv2的GPU实现相比,所提出的FPGA检测器具有更高的识别精度和更低的功耗。与YOLOv2相比,所提出的FPGA 1在识别精度方面更高,并且其功耗低于基于GPU的YOLOv2。因此,基于FPGA的对象检测器适用于嵌入式实时应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号