首页> 外文会议>International symposium on search based software engineering >Amortised Deep Parameter Optimisation of GPGPU Work Group Size for OpenCV
【24h】

Amortised Deep Parameter Optimisation of GPGPU Work Group Size for OpenCV

机译:OpenCV的GPGPU工作组大小的分期摊销的深度参数优化

获取原文

摘要

GPGPU (General Purpose computing on Graphics Processing Units) enables massive parallelism by taking advantage of the Single Instruction Multiple Data (SIMD) architecture of the large number of cores found on modern graphics cards. A parameter called local work group size controls how many work items are concurrently executed on a single compute unit. Though critical to the performance, there is no deterministic way to tune it, leaving developers to manual trial and error. This paper applies amortised optimisation to determine the best local work group size for GPGPU implementations of OpenCV template matching feature. The empirical evaluation shows that optimised local work group size can outperform the default value with large effect sizes.
机译:GPGPU(图形处理单元上的通用计算)通过利用现代图形卡上大量内核的单指令多数据(SIMD)架构,实现了大规模并行处理。称为本地工作组大小的参数控制在单个计算单元上同时执行多少个工作项。尽管对性能至关重要,但没有确定性的方法可以对其进行调整,从而使开发人员不得不手动进行试验和出错。本文应用摊销优化来确定OpenCV模板匹配功能的GPGPU实施的最佳本地工作组大小。实证评估表明,优化的本地工作组规模可以在效果较大的情况下胜过默认值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号