An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA

机译：基于多尺度滑动窗口搜索的对象检测器在FPGA上使用完全流水线二值化CNN进行搜索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An object detection problem consists of two problems: one is classification of detected object category and the other is localization. Frame object detection is used in an embedded vision systems, such as a robot, an automobile, a security camera, and a drone. These applications require high-performance computation and low-power consumption by an inexpensive device. This paper proposes multiscale sliding window based object detector using a fully pipelined binarized deep convolutional neural network (BCNN) on an FPGA. It consists of a sliding window part, a fully pipelined BCNN classifier, and an ARM processing unit for detection. Duplicate detections were filtered by using a non-maximum suppression algorithm running on the ARM processor. We propose the fully pipelined layers for the BCNN and its architecture for FPGA realization. Since the proposed BCNN circuit uses on-chip memories on the FPGA, its throughput is higher than a GPU based one with practical recognition accuracy. We trained the VGG11 based BCNN using the KITTI vision benchmark for the car detection scenario. Then, we implemented the proposed object detector on the Xilinx Inc. Zynq UltraScale+ MPSoC zcu102 evaluation board. The GPU based object detectors were too slow for the realtime application requirement (HD frame rate), with the exception of YOLOv2. As compared with the GPU implementation of YOLOv2, the proposed FPGA detector had higher recognition accuracy and lower power consumption. Compared with the YOLOv2, the proposed FPGA one is higher with respect to recognition accuracy, and its power consumption is lower than the GPU based YOLOv2. Thus, the FPGA based object detector suitable for the embedded realtime applications.

机译：对象检测问题由两个问题组成：一个是检测到的对象类别的分类，另一个是本地化。帧对象检测用于嵌入式视觉系统，例如机器人，汽车，安全摄像机和无人机。这些应用需要通过廉价的设备进行高性能计算和低功耗。本文提出了在FPGA上使用完全流水线二值化的深卷积神经网络（BCNN）的基于多尺度滑动窗口的物体检测器。它由滑动窗口部分，完全流水线的BCNN分类器和用于检测的臂处理单元组成。通过使用在ARM处理器上运行的非最大抑制算法来滤波重复检测。我们为BCNN提供了全流水线层及其用于FPGA实现的架构。由于所提出的BCNN电路在FPGA上使用片上存储器，因此其吞吐量高于GPU，其具有实际识别精度。我们使用基于VGG11的BCNN使用Kitti Vision基准测试汽车检测方案进行了培训。然后，我们在Xilinx Inc. Zynq UltraScale + MPSoC ZCU102评估板上实现了所提出的对象探测器。除了YOLOV2之外，基于GPU的对象检测器对于实时应用要求（HD帧速率）来说太慢了。与yolov2的GPU实施相比，所提出的FPGA检测器具有更高的识别精度和更低的功耗。与YOLOV2相比，所提出的FPGA相对于识别精度更高，其功耗低于基于GPU的YOLOV2。因此，基于FPGA的对象检测器适用于嵌入的实时应用。

著录项

来源
《International Conference on Field Programmable Technology》|2017年|302p|共8页
会议地点
作者
Hiroki Nakahara; Haruyoshi Yonekawa; Shimpei Sato;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP332.3-53;
关键词
Field programmable gate arrays; Detectors; Graphics processing units; Microsoft Windows; Two dimensional displays; Object detection; Proposals;

机译：现场可编程门阵列;探测器;图形处理单元;Microsoft Windows;二维显示器;对象检测;提案;

相似文献

外文文献
中文文献
专利

1. XNORCONV: CNNs accelerator implemented on FPGA using a hybrid CNNs structure and an inter-layer pipeline method [J] . Image Processing, IET . 2020,第1期

机译：XNORCONV：使用混合CNN结构和层间流水线方法在FPGA上实现的CNN加速器
2. Toward an Efficient Deep Pipelined Template-Based Architecture for Accelerating the Entire 2-D and 3-D CNNs on FPGA [J] . Shen Junzhong, Huang You, Wen Mei, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2020,第7期

机译：朝着高效的基于流水线模板的架构，用于加速FPGA上的整个2-D和3-D CNN
3. A New Volumetric CNN for 3D Object Classification Based on Joint Multiscale Feature and Subvolume Supervised Learning Approaches [J] . A. A. M. Muzahid, Wanggen Wan, Li Hou Computational intelligence and neuroscience . 2020,第4期

机译：基于联合多尺度特征的3D对象分类的新体积CNN，Supvolume监督学习方法
4. An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA [C] . Hiroki Nakahara, Haruyoshi Yonekawa, Shimpei Sato International Conference on Field Programmable Technology . 2017

机译：在FPGA上使用全流水线二值化CNN基于多尺度滑动窗口搜索的对象检测器
5. Comparative Study of Feature-Selective Sliding Window Object Detectors in Images. [D] . Waghmare, Sagar Manohar. 2012

机译：图像中特征选择滑动窗口目标检测器的比较研究。
6. A Pipelined Non-Deterministic Finite Automaton-Based String Matching Scheme Using Merged State Transitions in an FPGA [O] . HyunJin Kim, Kang-Il Choi -1

机译：在FPGA中使用合并状态转换的基于流水线的不确定自动机字符串匹配方案
7. Layer-Specific Optimization for Mixed Data Flow With Mixed Precision in FPGA Design for CNN-Based Object Detectors [O] . Duy Thanh Nguyen, Hyun Kim, Hyuk-Jae Lee 2021

机译：用于基于CNN的对象检测器的FPGA设计中与FPGA设计中混合数据流的层特异性优化

An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA

摘要

著录项

相似文献

相关主题

期刊订阅