首页> 外文期刊>Journal of Parallel and Distributed Computing >CUIRRE: An open-source library for load balancing and characterizing irregular applications on GPUs
【24h】

CUIRRE: An open-source library for load balancing and characterizing irregular applications on GPUs

机译:CUIRRE:一个开源库,用于负载均衡和表征GPU上的不规则应用程序

获取原文
获取原文并翻译 | 示例

摘要

While Graphics Processing Units (GPUs) show high performance for problems with regular structures, they do not perform well for irregular tasks due to the mismatches between irregular problem structures and SIMD-like GPU architectures. In this paper, we introduce a new library, CUIRRE, for improving performance of irregular applications on GPUs. CUIRRE reduces the load imbalance of GPU threads resulting from irregular loop structures. In addition, CUIRRE can characterize irregular applications for their irregularity, thread granularity and GPU utilization. We employ this library to characterize and optimize both synthetic and real-world applications. The experimental results show that a 1.63 × on average and up to 2.76 × performance improvement can be achieved with the centralized task pool approach in the library at a 4.57% average overhead with static loading ratios. To avoid the cost of exhaustive searches of loading ratios, an adaptive loading ratio method is proposed to derive appropriate loading ratios for different inputs automatically at runtime. Our task pool approach outperforms other load balancing schemes such as the task stealing method and the persistent threads method. The CUIRRE library can easily be applied on many other irregular problems.
机译:尽管图形处理单元(GPU)对于常规结构的问题显示出高性能,但由于非常规问题结构和类似于SIMD的GPU架构之间的不匹配,它们在非常规任务中的表现不佳。在本文中,我们引入了一个新的库CUIRRE,用于提高GPU上非常规应用程序的性能。 CUIRRE减少了由不规则循环结构导致的GPU线程的负载不平衡。此外,CUIRRE可以针对不规则应用程序的不规则性,线程粒度和GPU利用率进行特征描述。我们使用该库来表征和优化合成应用程序和实际应用程序。实验结果表明,使用库中的集中式任务池方法,在平均负载为4.57%且具有静态负载率的情况下,可以平均提高1.63×的性能,并提高性能。为了避免穷举搜索负载率的成本,提出了一种自适应负载率方法,可以在运行时自动为不同的输入导出合适的负载率。我们的任务池方法优于其他负载平衡方案,例如任务窃取方法和持久线程方法。 CUIRRE库可轻松应用于许多其他不规则问题。

著录项

  • 来源
    《Journal of Parallel and Distributed Computing》 |2014年第10期|2951-2966|共16页
  • 作者

    Tao Zhang; Wei Shu; Min-You Wu;

  • 作者单位

    Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai 200240, China,Department of Electrical & Computer Engineering, The University of New Mexico, Albuquerque 87131-0001, NM, USA, 3-121 SEIEE Building, Shanghai Jiao Tong University, 800 Dong Chuan Road, Min Hang District, Shanghai 200240, China;

    Department of Electrical & Computer Engineering, The University of New Mexico, Albuquerque 87131-0001, NM, USA Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;

    Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Load balancing; Characterizing; Library; Irregular; CPU;

    机译:负载均衡;表征图书馆;不规则;中央处理器;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号