首页> 外文会议>International Symposium on Applied Reconfigurable Computing >A Parameterisable FPGA-Tailored Architecture for YOLOv3-Tiny
【24h】

A Parameterisable FPGA-Tailored Architecture for YOLOv3-Tiny

机译:用于yolov3-tiny的参数可见的fpga-tared架构

获取原文

摘要

Object detection is the task of detecting the position of objects in an image or video as well as their corresponding class. The current state of the art approach that achieves the highest performance (i.e. fps) without significant penalty in accuracy of detection is the YOLO framework, and more specifically its latest version YOLOv3. When embedded systems are targeted for deployment, YOLOv3-tiny, a lightweight version of YOLOv3, is usually adopted. The presented work is the first to implement a parameterised FPGA-tailored architecture specifically for YOLOv3-tiny. The architecture is optimised for latency-sensitive applications, and is able to be deployed in low-end devices with stringent resource constraints. Experiments demonstrate that when a low-end FPGA device is targeted, the proposed architecture achieves a 290x improvement in latency, compared to the hard core processor of the device, achieving at the same time a reduction in mAP of 2.5 pp (30.9% vs 33.4%) compared to the original model. The presented work opens the way for low-latency object detection on low-end FPGA devices.
机译:对象检测是检测图像或视频中对象位置以及它们相应的类的任务。目前的现有方法是实现最高性能(即FPS)而无需在检测准确性的显着惩罚的情况下实现的是YOLO框架,更具体地是其最新版本的YOLOV3。当嵌入式系统针对部署时,通常采用yolov3-tiny,轻量级版本的yolov3。所呈现的工作是第一个实施专门为YOLOV3-TINY实现参数化的FPGA定制架构。该体系结构针对延迟敏感的应用程序进行了优化,并且能够在具有严格资源约束的低端设备中部署。实验表明,当目标低端FPGA器件时,拟议的体系结构达到延迟的290倍改善,与设备的硬核处理器相比,同时实现了2.5 pp的地图的相同时间(30.9%Vs 33.4 %)与原始模型相比。所呈现的工作为低端FPGA设备上的低延迟对象检测开辟了方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号