A Data-Driven Frequency Scaling Approach for Deadline-aware Energy Efficient Scheduling on Graphics Processing Units (GPUs)

机译：在图形处理单元（GPU）上了解截止日期的节能调度的数据驱动频率缩放方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern computing paradigms, such as cloud computing, are increasingly adopting GPUs to boost their computing capabilities primarily due to the heterogeneous nature of AI/ML/deep learning workloads. However, the energy consumption of GPUs is a critical problem. Dynamic Voltage Frequency Scaling (DVFS) is a widely used technique to reduce the dynamic power of GPUs. Yet, configuring the optimal clock frequency for essential performance requirements is a non-trivial task due to the complex nonlinear relationship between the application’s runtime performance characteristics, energy, and execution time. It becomes more challenging when different applications behave distinctively with similar clock settings. Simple analytical solutions and standard GPU frequency scaling heuristics fail to capture these intricacies and scale the frequencies appropriately. In this regard, we propose a data-driven frequency scaling technique by predicting the power and execution time of a given application over different clock settings. We collect the data from application profiling and train the models to predict the outcome accurately. The proposed solution is generic and can be easily extended to different kinds of workloads and GPU architectures. Furthermore, using this frequency scaling by prediction models, we present a deadline-aware application scheduling algorithm to reduce energy consumption while simultaneously meeting their deadlines. We conduct real extensive experiments on NVIDIA GPUs using several benchmark applications. The experiment results have shown that our prediction models have high accuracy with the average RMSE values of 0.38 and 0.05 for energy and time prediction, respectively. Also, the scheduling algorithm consumes 15.07% less energy as compared to the baseline policies.

机译：诸如云计算之类的现代计算范例越来越多地采用GPU来增强其计算能力，这主要是由于AI / ML /深度学习工作负载的异构性质。但是，GPU的能耗是一个关键问题。动态电压频率缩放（DVFS）是一种广泛用于降低GPU动态功耗的技术。然而，由于应用程序的运行时性能特征，能量和执行时间之间存在复杂的非线性关系，因此为满足基本性能要求而配置最佳时钟频率并非易事。当不同的应用程序在相似的时钟设置下表现出独特的性能时，它将变得更具挑战性。简单的分析解决方案和标准的GPU频率缩放试探法无法捕获这些复杂性，无法适当地缩放频率。在这方面，我们通过预测给定应用程序在不同时钟设置下的功耗和执行时间，提出了一种数据驱动的频率缩放技术。我们从应用程序分析收集数据，并训练模型以准确预测结果。提出的解决方案是通用的，可以轻松扩展到不同种类的工作负载和GPU架构。此外，使用预测模型的频率缩放比例，我们提出了一种可感知截止日期的应用程序调度算法，以减少能耗，同时满足截止日期。我们使用多个基准测试应用程序在NVIDIA GPU上进行了真正的广泛实验。实验结果表明，我们的预测模型具有很高的准确性，能量和时间预测的平均RMSE值分别为0.38和0.05。此外，与基准策略相比，调度算法消耗的能源少15.07％。

著录项

来源
《IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing》|2020年|579-588|共10页
会议地点
作者
Shashikant Ilager; Rajeev Muralidhar; Kotagiri Rammohanrao; Rajkumar Buyya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
GPU; Energy; Data-Driven; Scheduling; Machine Learning;

机译：GPU;能源;数据驱动;调度;机器学习;

相似文献

外文文献
中文文献
专利

1. GPUQT: An efficient linear-scaling quantum transport code fully implemented on graphics processing units [J] . Fan Zheyong, Vierimaa Ville, Harju Ari Computer physics communications . 2018,第期

机译：GPUQT：在图形处理单元上完全实现的高效线性缩放量子传输代码
2. A hybrid CPU-Graphics Processing Unit (GPU) approach for computationally efficient simulation-optimization [J] . Mai Chan Lau, Rajagopalan Srinivasan Computers & Chemical Engineering . 2016,第Apra6期

机译：混合CPU图形处理单元（GPU）方法可实现高效的计算优化
3. Autotuning based on frequency scaling toward energy efficiency of blockchain algorithms on graphics processing units [J] . Matthias Stachowski, Alexander Fiebig, Thomas Rauber Journal of supercomputing . 2021,第1期

机译：基于频率缩放对图形处理单元中区块链算法能效的频率缩放
4. Energy-Efficient Task Scheduling in Manycore Processors with Frequency Scaling Overhead [C] . Eitschberger Patrick, Keller Jorg Euromicro International Conference on Parallel, Distributed and Network-Based Processing . 2015

机译：具有频率扩展开销的Manycore处理器中的节能任务调度
5. High performance multiscale image processing framework on multi-GPUs (graphics processing units) with applications to unbiased diffeomorphic atlas construction. [D] . Ha, Linh Khanh. 2011

机译：多GPU（图形处理单元）上的高性能多尺度图像处理框架，可应用于无偏微晶图集构造。
6. GPUmotif: An Ultra-Fast and Energy-Efficient Motif Analysis Program Using Graphics Processing Units [O] . Pooya Zandevakili, Ming Hu, Zhaohui Qin 2009

机译：GPUmotif：使用图形处理单元的超快速节能型母题分析程序
7. GPUQT: An efficient linear-scaling quantum transport code fully implemented on graphics processing units [O] . Fan, Zheyong, Vierimaa, Ville, Harju, Ari 2017

机译：GpUQT：完全有效的线性缩放量子传输代码在图形处理单元上实现

A Data-Driven Frequency Scaling Approach for Deadline-aware Energy Efficient Scheduling on Graphics Processing Units (GPUs)

摘要

著录项

相似文献

相关主题

期刊订阅