首页> 外文会议>International conference on future information technology;International conference on multimedia and ubiquitous engineering >Optimal Design Parameters for Tiny-YOLO2 Implementation with Light-Weight Embedded GPGPU Environment
【24h】

Optimal Design Parameters for Tiny-YOLO2 Implementation with Light-Weight Embedded GPGPU Environment

机译:轻量级嵌入式GPGPU环境中实现Tiny-YOLO2的最佳设计参数

获取原文

摘要

The aim of this paper is to find the optimal design for tiny YOLO2 in an embedded system environment. Our focus is to rebuild the given YOLO2 code and to find the optimal design parameters in order to maximize the speed using the light-weight GPGPU in a target embedded environment. To maximize the YOLO2 performance we used OpenCL framework while utilizing the embedded GPGPU and tried various aspects of OpenCL design parameters such work item, work group, and resulting in reducing the global memory access overhead and maximizing computing load balancing between computing units under constraints including local memory resources and computing resources. Our experimental results show that the overall performance enhancement is 18.2 times compared to the naive implementation.
机译:本文的目的是为嵌入式系统环境中的小型YOLO2找到最佳设计。我们的重点是重建给定的YOLO2代码并找到最佳设计参数,以便在目标嵌入式环境中使用轻量级GPGPU来最大化速度。为了最大程度地发挥YOLO2的性能,我们在使用嵌入式GPGPU的同时使用了OpenCL框架,并尝试了OpenCL设计参数的各个方面,例如工作项,工作组,从而降低了全局内存访问开销,并在包括本地约束在内的计算单元之间实现了最大的计算负载平衡。内存资源和计算资源。我们的实验结果表明,与单纯的实现相比,整体性能提高了18.2倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号