首页> 外文会议>International Conference on Field Programmable Technology >An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA
【24h】

An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA

机译:基于多尺度滑动窗口搜索的对象检测器在FPGA上使用完全流水线二值化CNN进行搜索

获取原文

摘要

An object detection problem consists of two problems: one is classification of detected object category and the other is localization. Frame object detection is used in an embedded vision systems, such as a robot, an automobile, a security camera, and a drone. These applications require high-performance computation and low-power consumption by an inexpensive device. This paper proposes multiscale sliding window based object detector using a fully pipelined binarized deep convolutional neural network (BCNN) on an FPGA. It consists of a sliding window part, a fully pipelined BCNN classifier, and an ARM processing unit for detection. Duplicate detections were filtered by using a non-maximum suppression algorithm running on the ARM processor. We propose the fully pipelined layers for the BCNN and its architecture for FPGA realization. Since the proposed BCNN circuit uses on-chip memories on the FPGA, its throughput is higher than a GPU based one with practical recognition accuracy. We trained the VGG11 based BCNN using the KITTI vision benchmark for the car detection scenario. Then, we implemented the proposed object detector on the Xilinx Inc. Zynq UltraScale+ MPSoC zcu102 evaluation board. The GPU based object detectors were too slow for the realtime application requirement (HD frame rate), with the exception of YOLOv2. As compared with the GPU implementation of YOLOv2, the proposed FPGA detector had higher recognition accuracy and lower power consumption. Compared with the YOLOv2, the proposed FPGA one is higher with respect to recognition accuracy, and its power consumption is lower than the GPU based YOLOv2. Thus, the FPGA based object detector suitable for the embedded realtime applications.
机译:对象检测问题由两个问题组成:一个是检测到的对象类别的分类,另一个是本地化。帧对象检测用于嵌入式视觉系统,例如机器人,汽车,安全摄像机和无人机。这些应用需要通过廉价的设备进行高性能计算和低功耗。本文提出了在FPGA上使用完全流水线二值化的深卷积神经网络(BCNN)的基于多尺度滑动窗口的物体检测器。它由滑动窗口部分,完全流水线的BCNN分类器和用于检测的臂处理单元组成。通过使用在ARM处理器上运行的非最大抑制算法来滤波重复检测。我们为BCNN提供了全流水线层及其用于FPGA实现的架构。由于所提出的BCNN电路在FPGA上使用片上存储器,因此其吞吐量高于GPU,其具有实际识别精度。我们使用基于VGG11的BCNN使用Kitti Vision基准测试汽车检测方案进行了培训。然后,我们在Xilinx Inc. Zynq UltraScale + MPSoC ZCU102评估板上实现了所提出的对象探测器。除了YOLOV2之外,基于GPU的对象检测器对于实时应用要求(HD帧速率)来说太慢了。与yolov2的GPU实施相比,所提出的FPGA检测器具有更高的识别精度和更低的功耗。与YOLOV2相比,所提出的FPGA相对于识别精度更高,其功耗低于基于GPU的YOLOV2。因此,基于FPGA的对象检测器适用于嵌入的实时应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号