首页> 外文会议>International Conference on Innovative Mechanisms for Industry Applications >Prediction Model of an HPC Application on CPU-GPU Cluster using Machine Learning Techniques
【24h】

Prediction Model of an HPC Application on CPU-GPU Cluster using Machine Learning Techniques

机译:使用机器学习技术的CPU-GPU集群上HPC应用程序的预测模型

获取原文

摘要

In today's world hybrid computing cluster, is comprised of high-intensity computation central processing unit (CPU) and graphical processing unit (GPU) based nodes. In this article, a novel analytical prediction model by considering parameters such as the number of CPUs+GPUs cores, peripheral component interconnects express(PCI-E) bandwidth and CPU-GPU memory access bandwidth for varying data input sizes and recorded as historical data. A partial amount of data is tested to train our novel prediction model. Predicted execution time against actual execution time has been compared to enhance the accuracy of the model and reduce or remove any errors. The proposed prediction model is a major module that has been utilized from scheduling the strategy to scheduling the high-performance computing (HPC) application, which gives the least predicted execution time on the best resources of a heterogeneous cluster. The proposed predictive scheduling scheme with scheduling strategy has been tested by using the game of life benchmark applications on a CPU-GPUs cluster. The prediction model has been compared against machine learning techniques and it is observed that the proposed novel analytical prediction model has achieved less than 19% prediction error. The performance of our predictive scheduling scheme with other best existing schemes TORQUE has been compared, and it is also observed that the predictive scheduling scheme is 63% more efficient than the TORQUE.
机译:在当今世界的混合计算集群中,它由基于高强度计算的中央处理器(CPU)和图形处理单元(GPU)的节点组成。在本文中,通过考虑诸如CPU + GPU内核数,外围组件互连Express(PCI-E)带宽和CPU-GPU内存访问带宽之类的参数的新颖分析预测模型,以改变数据输入大小并记录为历史数据。测试了部分数据以训练我们新颖的预测模型。已将预测的执行时间与实际的执行时间进行了比较,以提高模型的准确性并减少或消除任何错误。所提出的预测模型是从调度策略到调度高性能计算(HPC)应用程序所使用的主要模块,该模型在异构集群的最佳资源上给出了最少的预测执行时间。通过使用CPU-GPU集群上的生命基准测试应用程序,对带有调度策略的预测调度方案进行了测试。将该预测模型与机器学习技术进行了比较,并且观察到所提出的新型分析预测模型已实现了小于19%的预测误差。我们将预测性调度方案与其他现有最佳方案TORQUE的性能进行了比较,并且还发现,预测性调度方案的效率比TORQUE高63%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号