首页> 外文会议>IEEE International Symposium on Workload Characterization >Exploring GPGPU workloads: Characterization methodology, analysis and microarchitecture evaluation implications
【24h】

Exploring GPGPU workloads: Characterization methodology, analysis and microarchitecture evaluation implications

机译:探索GPGPU工作负载:表征方法,分析和微体系结构评估的意义

获取原文

摘要

The GPUs are emerging as a general-purpose high-performance computing device. Growing GPGPU research has made numerous GPGPU workloads available. However, a systematic approach to characterize these benchmarks and analyze their implication on GPU microarchitecture design evaluation is still lacking. In this research, we propose a set of microarchitecture agnostic GPGPU workload characteristics to represent them in a microarchitecture independent space. Correlated dimensionality reduction process and clustering analysis are used to understand these workloads. In addition, we propose a set of evaluation metrics to accurately evaluate the GPGPU design space. With growing number of GPGPU workloads, this approach of analysis provides meaningful, accurate and thorough simulation for a proposed GPU architecture design choice. Architects also benefit by choosing a set of workloads to stress their intended functional block of the GPU microarchitecture. We present a diversity analysis of GPU benchmark suites such as Nvidia CUDA SDK, Parboil and Rodinia. Our results show that with a large number of diverse kernels, workloads such as Similarity Score, Parallel Reduction, and Scan of Large Arrays show diverse characteristics in different workload spaces. We have also explored diversity in different workload subspaces (e.g. memory coalescing and branch divergence). Similarity Score, Scan of Large Arrays, MUMmerGPU, Hybrid Sort, and Nearest Neighbor workloads exhibit relatively large variation in branch divergence characteristics compared to others. Memory coalescing behavior is diverse in Scan of Large Arrays, K-Means, Similarity Score and Parallel Reduction.
机译:GPU逐渐成为一种通用的高性能计算设备。不断增长的GPGPU研究已使众多GPGPU工作负载可用。但是,仍然缺乏一种系统的方法来表征这些基准并分析它们对GPU微体系结构设计评估的影响。在这项研究中,我们提出了一组与微体系结构无关的GPGPU工作负载特征,以在独立于微体系结构的空间中表示它们。相关的降维过程和聚类分析用于了解这些工作负载。此外,我们提出了一组评估指标来准确评估GPGPU设计空间。随着越来越多的GPGPU工作负载,这种分析方法为建议的GPU架构设计选择提供了有意义,准确和彻底的仿真。架构师还可以通过选择一组工作负载来强调其预期的GPU微体系结构功能块,从而从中受益。我们对GPU基准套件(例如Nvidia CUDA SDK,Parboil和Rodinia)进行了多样性分析。我们的结果表明,在使用大量不同内核的情况下,诸如相似度评分,并行减少和大型阵列扫描之类的工作负载在不同的工作负载空间中显示出不同的特征。我们还探讨了不同工作负载子空间中的多样性(例如内存合并和分支分歧)。与其他相比,相似性评分,大型阵列扫描,MUMmerGPU,混合排序和最近邻居工作负载在分支发散特性方面表现出相对较大的变化。内存合并行为在大型阵列扫描,K均值,相似性得分和并行减少方面是多种多样的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号