Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems

Rjoub Gaith; Bentahar Jamal; Wahab Omar Abdel; Bataineh Ahmed Saleh

首页> 外文期刊>Concurrency and computation: practice and experience >Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems

【24h】

Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems

机译：大型云计算系统中自动任务调度的深度和加强学习

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Cloud computing is undeniably becoming the main computing and storage platform for today's major workloads. From Internet of things and Industry 4.0 workloads to big data analytics and decision-making jobs, cloud systems daily receive a massive number of tasks that need to be simultaneously and efficiently mapped onto the cloud resources. Therefore, deriving an appropriate task scheduling mechanism that can both minimize tasks' execution delay and cloud resources utilization is of prime importance. Recently, the concept of cloud automation has emerged to reduce the manual intervention and improve the resource management in large-scale cloud computing workloads. In this article, we capitalize on this concept and propose four deep and reinforcement learning-based scheduling approaches to automate the process of scheduling large-scale workloads onto cloud computing resources, while reducing both the resource consumption and task waiting time. These approaches are: reinforcement learning (RL), deep Q networks, recurrent neural network long short-term memory (RNN-LSTM), and deep reinforcement learning combined with LSTM (DRL-LSTM). Experiments conducted using real-world datasets from Google Cloud Platform revealed that DRL-LSTM outperforms the other three approaches. The experiments also showed that DRL-LSTM minimizes the CPU usage cost up to67%compared with the shortest job first (SJF), and up to35%compared with both the round robin (RR) and improved particle swarm optimization (PSO) approaches. Moreover, our DRL-LSTM solution decreases the RAM memory usage cost up to72%compared with the SJF, up to65%compared with the RR, and up to31.25%compared with the improved PSO.

机译：云计算无可否认成为当今主要工作负载的主要计算和存储平台。从物联网和行业4.0工作负载到大数据分析和决策作业，云系统每天都会获得需要同时和有效地映射到云资源的大量任务。因此，导出适当的任务调度机制，其可以最小化任务的执行延迟和云资源利用率是主要的重要性。最近，已经出现了云自动化的概念，以减少手动干预，提高大规模云计算工作负载中的资源管理。在本文中，我们利用了这一概念，提出了四种基于深度和加强的学习的调度方法，以自动将大规模工作负载安排到云计算资源的过程，同时减少资源消耗和任务等待时间。这些方法是：加固学习（RL），深Q网络，经常性神经网络长短期内存（RNN-LSTM）以及与LSTM（DRL-LSTM）相结合的深增强学习。使用来自Google云平台的实际数据集进行的实验显示DRL-LSTM优于其他三种方法。实验还显示DRL-LSTM与最短工作（SJF）相比，DRL-LSTM最小化CPU使用率高达67％，与循环（RR）和改进的粒子群优化（PSO）接近相比，高达35％。此外，与SJF相比，我们的DRL-LSTM解决方案降低了RAM内存使用量高达72％，与RR相比，高达65％，与改进的PSO相比，高达31.25％。

著录项

来源
《Concurrency and computation: practice and experience》 |2021年第23期|e5919.1-e5919.14|共14页
作者
Rjoub Gaith; Bentahar Jamal; Wahab Omar Abdel; Bataineh Ahmed Saleh;
展开▼
作者单位

Concordia Univ Concordia Inst Informat Syst Engn Sir George Williams Campus 1455 Maisonneuve Blvd Montreal PQ Canada;

Concordia Univ Concordia Inst Informat Syst Engn Sir George Williams Campus 1455 Maisonneuve Blvd Montreal PQ Canada;

Univ Quebec Outaouais Dept Comp Sci & Engn Gatineau PQ Canada;

Concordia Univ Concordia Inst Informat Syst Engn Sir George Williams Campus 1455 Maisonneuve Blvd Montreal PQ Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
cloud automation; deep learning; reinforcement learning; task scheduling;

机译：云自动化;深入学习;加强学习;任务调度;

相似文献

外文文献
中文文献
专利

1. Reliability-Aware: Task Scheduling in Cloud Computing Using Multi-Agent Reinforcement Learning Algorithm and Neural Fitted Q [J] . Balla Husamelddin, Sheng Chen, Jing Weipeng The international arab journal of information technology . 2021,第1期

机译：可靠性感知：使用多功能钢筋学习算法和神经拟合Q的云计算任务调度
2. Random task scheduling scheme based on reinforcement learning in cloud computing [J] . Peng Zhiping, Cui Delong, Zuo Jinglong, Cluster computing . 2015,第4期

机译：云计算中基于强化学习的随机任务调度方案
3. Task Scheduling in Cloud Using Deep Reinforcement Learning [J] . Shashank Swarup, Elhadi M. Shakshuki, Ansar Yasar Procedia Computer Science . 2021,第1期

机译：使用深钢筋学习云中的任务安排
4. DRL-cloud: Deep reinforcement learning-based resource provisioning and task scheduling for cloud service providers [C] . Mingxi Cheng, Ji Li, Shahin Nazarian Asia and South Pacific Design Automation Conference . 2018

机译：DRL-cloud：面向云服务提供商的基于深度强化学习的资源供应和任务调度
5. Energy and Performance-Optimized Scheduling of Tasks in Distributed Cloud and Edge Computing Systems [D] . ?Yuan, Haitao 2020

机译：分布式云和边缘计算系统中任务的能量和性能优化调度
6. Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing [O] . Shuran Sheng, Peng Chen, Zhimin Chen, 2021

机译：基于深度加强学习的IOT Edge Computing任务调度
7. Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing [O] . Shuran Sheng, Peng Chen, Zhimin Chen, 2021

机译：基于深度加强学习的IOT Edge Computing任务调度

Deep and reinforcement learning for automated task scheduling in large-scale cloud computing systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅