首页> 外文会议>IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing >A Data-Driven Frequency Scaling Approach for Deadline-aware Energy Efficient Scheduling on Graphics Processing Units (GPUs)
【24h】

A Data-Driven Frequency Scaling Approach for Deadline-aware Energy Efficient Scheduling on Graphics Processing Units (GPUs)

机译:在图形处理单元(GPU)上了解截止日期的节能调度的数据驱动频率缩放方法

获取原文

摘要

Modern computing paradigms, such as cloud computing, are increasingly adopting GPUs to boost their computing capabilities primarily due to the heterogeneous nature of AI/ML/deep learning workloads. However, the energy consumption of GPUs is a critical problem. Dynamic Voltage Frequency Scaling (DVFS) is a widely used technique to reduce the dynamic power of GPUs. Yet, configuring the optimal clock frequency for essential performance requirements is a non-trivial task due to the complex nonlinear relationship between the application’s runtime performance characteristics, energy, and execution time. It becomes more challenging when different applications behave distinctively with similar clock settings. Simple analytical solutions and standard GPU frequency scaling heuristics fail to capture these intricacies and scale the frequencies appropriately. In this regard, we propose a data-driven frequency scaling technique by predicting the power and execution time of a given application over different clock settings. We collect the data from application profiling and train the models to predict the outcome accurately. The proposed solution is generic and can be easily extended to different kinds of workloads and GPU architectures. Furthermore, using this frequency scaling by prediction models, we present a deadline-aware application scheduling algorithm to reduce energy consumption while simultaneously meeting their deadlines. We conduct real extensive experiments on NVIDIA GPUs using several benchmark applications. The experiment results have shown that our prediction models have high accuracy with the average RMSE values of 0.38 and 0.05 for energy and time prediction, respectively. Also, the scheduling algorithm consumes 15.07% less energy as compared to the baseline policies.
机译:诸如云计算之类的现代计算范例越来越多地采用GPU来增强其计算能力,这主要是由于AI / ML /深度学习工作负载的异构性质。但是,GPU的能耗是一个关键问题。动态电压频率缩放(DVFS)是一种广泛用于降低GPU动态功耗的技术。然而,由于应用程序的运行时性能特征,能量和执行时间之间存在复杂的非线性关系,因此为满足基本性能要求而配置最佳时钟频率并非易事。当不同的应用程序在相似的时钟设置下表现出独特的性能时,它将变得更具挑战性。简单的分析解决方案和标准的GPU频率缩放试探法无法捕获这些复杂性,无法适当地缩放频率。在这方面,我们通过预测给定应用程序在不同时钟设置下的功耗和执行时间,提出了一种数据驱动的频率缩放技术。我们从应用程序分析收集数据,并训练模型以准确预测结果。提出的解决方案是通用的,可以轻松扩展到不同种类的工作负载和GPU架构。此外,使用预测模型的频率缩放比例,我们提出了一种可感知截止日期的应用程序调度算法,以减少能耗,同时满足截止日期。我们使用多个基准测试应用程序在NVIDIA GPU上进行了真正的广泛实验。实验结果表明,我们的预测模型具有很高的准确性,能量和时间预测的平均RMSE值分别为0.38和0.05。此外,与基准策略相比,调度算法消耗的能源少15.07%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号