首页> 外文会议>IEEE International Conference on Big Data and Smart Computing >OpenCL-Darknet: An OpenCL Implementation for Object Detection
【24h】

OpenCL-Darknet: An OpenCL Implementation for Object Detection

机译:OpenCL-Darknet:用于对象检测的OpenCL实现

获取原文

摘要

Object detection is a technology that deals with recognizing classes of objects and their location. It is used in many different areas, such as in face-detecting digital cameras, surveillance tools, or self-driving cars. These days, deep learning-based object detection approaches have achieved significantly better performance than the classic feature-based algorithms. Darknet [1] is a deep learning-based object detection framework, which is well known for its fast speed and simple structure. Unfortunately, like many other frameworks, Darknet only supports NVIDIA CUDA [2] for accelerating its calculations. For this reason, a user has only limited options for graphic card selection. OpenCL" (open computing language) [3] is an open standard for cross-platform, parallel programming of heterogeneous systems. It is available not only for CPUs, GPUs (graphics processing units), but also for DSPs (digital signal processors), FPGAs (field-programmable gate arrays) and other hardware accelerators. In this paper, we present the OpenCL-Darknet, which transforms the CUDA-based Darknet into an open standard OpenCL backend. Our goal was to implement a deep learning-based object detection framework that will be available for the general accelerator hardware and to achieve competitive performance compared to the original CUDA version. We evaluated the OpenCL-Darknet in AMD R7-integraged APU (accelerated processing unit) with OpenCL 2.0 and AMD Radeon RX560 with OpenCL 1.2 using a VOC 2007 dataset [4]. We also compared its performance with the original Darknet for NVIDIA GTX 1050 with CUDA 8.0 and cuDNN 6.0.
机译:对象检测是一种用于识别对象类别及其位置的技术。它被用于许多不同的领域,例如面部检测数码相机,监视工具或自动驾驶汽车。如今,基于深度学习的对象检测方法已经比基于经典特征的算法获得了明显更好的性能。 Darknet [1]是一个基于深度学习的对象检测框架,以其快速和简单的结构而闻名。不幸的是,与许多其他框架一样,Darknet仅支持NVIDIA CUDA [2]来加速其计算。因此,用户只有有限的图形卡选择选项。 “ OpenCL”(开放计算语言)[3]是用于异构系统的跨平台并行编程的开放标准。它不仅适用于CPU,GPU(图形处理单元),还适用于DSP(数字信号处理器), FPGA(现场可编程门阵列)和其他硬件加速器在本文中,我们介绍了OpenCL-Darknet,它将基于CUDA的Darknet转换为开放标准的OpenCL后端,我们的目标是实现基于深度学习的对象检测与原始CUDA版本相比,该框架可用于通用加速器硬件并获得竞争性能我们评估了使用OpenCL 2.0的AMD R7集成APU(加速处理单元)和使用OpenCL 1.2的AMD Radeon RX560的OpenCL-Darknet VOC 2007数据集[4]。我们还将其性能与带有CUDA 8.0和cuDNN 6.0的NVIDIA GTX 1050的原始Darknet进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号