A Parameterisable FPGA-Tailored Architecture for YOLOv3-Tiny

机译：用于yolov3-tiny的参数可见的fpga-tared架构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Object detection is the task of detecting the position of objects in an image or video as well as their corresponding class. The current state of the art approach that achieves the highest performance (i.e. fps) without significant penalty in accuracy of detection is the YOLO framework, and more specifically its latest version YOLOv3. When embedded systems are targeted for deployment, YOLOv3-tiny, a lightweight version of YOLOv3, is usually adopted. The presented work is the first to implement a parameterised FPGA-tailored architecture specifically for YOLOv3-tiny. The architecture is optimised for latency-sensitive applications, and is able to be deployed in low-end devices with stringent resource constraints. Experiments demonstrate that when a low-end FPGA device is targeted, the proposed architecture achieves a 290x improvement in latency, compared to the hard core processor of the device, achieving at the same time a reduction in mAP of 2.5 pp (30.9% vs 33.4%) compared to the original model. The presented work opens the way for low-latency object detection on low-end FPGA devices.

机译：对象检测是检测图像或视频中对象位置以及它们相应的类的任务。目前的现有方法是实现最高性能（即FPS）而无需在检测准确性的显着惩罚的情况下实现的是YOLO框架，更具体地是其最新版本的YOLOV3。当嵌入式系统针对部署时，通常采用yolov3-tiny，轻量级版本的yolov3。所呈现的工作是第一个实施专门为YOLOV3-TINY实现参数化的FPGA定制架构。该体系结构针对延迟敏感的应用程序进行了优化，并且能够在具有严格资源约束的低端设备中部署。实验表明，当目标低端FPGA器件时，拟议的体系结构达到延迟的290倍改善，与设备的硬核处理器相比，同时实现了2.5 pp的地图的相同时间（30.9％Vs 33.4 ％）与原始模型相比。所呈现的工作为低端FPGA设备上的低延迟对象检测开辟了方法。

著录项

来源
《International Symposium on Applied Reconfigurable Computing》|2020年|330-344|共15页
会议地点
作者
Zhewen Yu; Christos-Savvas Bouganis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
YOLOv3-tiny; FPGA; Object detection;

机译：yolov3-tiny;FPGA;对象检测;
入库时间 2022-08-26 13:55:54

相似文献

外文文献
中文文献
专利

1. Compact and high-throughput parameterisable architectures for memory-based FFT algorithms [J] . Valencia Daniel, Alimohammad Amirhossein Circuits, Devices & Systems, IET . 2019,第5期

机译：紧凑且高通量的可参数化架构，用于基于存储器的FFT算法
2. Customised soft processor design: a compromise between architecture description languages and parameterisable processors [J] . Vakili, S., Langlois, Computers & Digital Techniques, IET . 2013,第3期

机译：定制的软处理器设计：架构描述语言和可设置参数的处理器之间的折衷
3. An improved YOLOv3-tiny method for fire detection in the construction industry [J] . Jichao Li, Shengyu Guo, Liulin Kong, E3S Web of Conferences . 2021,第a期

机译：建筑业火灾探测的改进yolov3-tiny方法
4. Quantize YOLOv3-tiny For 5-bit Hardware [C] . Yang Hua, Lixin Yu, Xiao Meng, International Conference on Advanced Electronic Materials, Computers and Software Engineering . 2021

机译：量化Yolov3-Tiny对于5位硬件
5. Decoding Chinese classical architecture for contemporary architectural design, with special reference to modern architectural development in Taiwan [D] . Sung, Li-wen 2006

机译：解码中国古典建筑以进行当代建筑设计，并特别参考台湾的现代建筑发展
6. Impact of Coronary Stent Architecture on Clinical Outcomes: Do Minor Changes in Stent Architecture Really Matter? [O] . Amin Ariff Bin Nuruddin, Wan Azman Wan Ahmad, Matthias Waliszewski, 2021

机译：冠状动脉支架建筑对临床结果的影响：支架建筑的微小变化是否真的很重要？
7. A Parameterisable and Scalable SmithWaterman Algorithm Implementation on CUDAcompatible GPUs [O] . Cheng Ling, Khaled Benkrid, Tsuyoshi Hamada 2009

机译：CUDa兼容GpU上的可参数化和可扩展的smithWaterman算法实现

A Parameterisable FPGA-Tailored Architecture for YOLOv3-Tiny

摘要

著录项

相似文献

相关主题

期刊订阅