...
首页> 外文期刊>Journal of Theoretical and Applied Information Technology >STATIC MAPPING FOR OPENCL WORKLOADS IN HETEROGENEOUS COMPUTER SYSTEMS
【24h】

STATIC MAPPING FOR OPENCL WORKLOADS IN HETEROGENEOUS COMPUTER SYSTEMS

机译:异构计算机系统中OpenCL工作负载的静态映射

获取原文

摘要

Today, heterogeneous computer systems (HCS) commonly rely on CPU and GPU, for processing elements, and OpenCL, for the programming framework. In an HCS, a workload should execute on its best processor to achieve its best speedup. OpenCL currently entirely lefts the selection for the best-fit processor, termed as workload mapping, to programmers. However, the NP-completeness of the workload mapping task indicates it is not a trivial task to do manually by programmers so that effective computational approaches are necessary. This research proposes a static mapping method for OpenCL workloads that automatically select the best-fit processor for the workloads. The method accepts static features of a workload and utilizes K-Nearest Neighbor algorithm to classify the workload to either CPU or GPU. The static features are collected using LLVM/Clang compiler framework. To increase the accuracy of classification while keep maintaining the physical meaning of features, the features are reduced using feature selection approaches. Two feature selection models, filter model and wrapper model, are used in this research. This approach was evaluated using k-fold cross-validation against 18 OpenCL kernels obtained from standard benchmark packages. According to the evaluation results, the workload mapping accuracy was in the range of 93% to 97% indicating the method is well applicable in the HC environment with two processors. Floating-point operations and vector-integer operations, or floating-point operations and vector-global memory access are the combinations of features that a have significant contribution to the classification of workloads. The main contribution of the method in this research, compared to previous related research, lies in its capability to state features that are significant in the classification process.
机译:如今,异构计算机系统(HCS)通常依赖于CPU和GPU,用于处理元件和OpenCL,用于编程框架。在HCS中,工作负载应在其最佳处理器上执行以实现最佳加速。 OpenCL目前完全留下了最适合处理器的选择,称为Workload Mapping,对程序员。但是,工作负载映射任务的NP完整性表示不需要编程器手动执行的琐碎任务,以便需要有效的计算方法。本研究提出了一种用于OpenCL工作负载的静态映射方法,可自动为工作负载选择最适合的处理器。该方法接受工作负载的静态特征,并利用K-Collect Neible算法将工作负载分类为CPU或GPU。使用LLVM / CLANG编译器框架收集静态功能。为了提高分类的准确性,同时保持特征的特征的物理含义,使用特征选择方法减少了特征。本研究使用了两个特征选择模型,过滤器模型和包装型号。使用k折叠交叉验证对从标准基准封装获得的18个OpenCL内核进行评估。根据评估结果,工作负载映射精度在93%至97%的范围内,表示该方法适用于HC环境,具有两个处理器。浮点操作和矢量整数操作,或浮点操作和矢量全局内存访问是具有对工作负载分类具有重要贡献的功能的组合。与以前的相关研究相比,该研究中该研究的主要贡献在于其对分类过程中具有重要意义的状态特征的能力。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号