首页> 外文期刊>Engineering Applications of Artificial Intelligence >Reinforcement Learning based scheduling in a workflow management system
【24h】

Reinforcement Learning based scheduling in a workflow management system

机译:基于加强学习的工作流管理系统中的计划

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Any computational process from simple data analytics tasks to training a machine learning model can be described by a workflow. Many workflow management systems (WMS) exist that undertake the task of scheduling workflows across distributed computational resources. In this work, we introduce a WMS that leverages machine learning to predict workflow task runtime and the probability of failure of task assignments to execution sites. The expected runtime of workflow tasks can be used to approximate the weight of the workflow graph branches in respect to the total workflow workload and the ability to anticipate task failures can discourage task assignments that are unlikely to succeed. We demonstrate that the proposed machine learning models can lead to significantly more informed scheduling decisions that minimize task failures and utilize execution sites more efficiently, thus leading to reduced workflow runtime. Additionally, we train a modified sequence-to-sequence neural network architecture via reinforcement learning to perform scheduling decisions as part of a WMS. Our approach introduces a WMS that can drastically improve its scheduling performance by independently learning over time, without external intervention or reliance on any specific heuristic or optimization technique. Finally, we test our approach in real-world scenarios utilizing computationally demanding and data intensive workflows and evaluate its performance against existing scheduling methodologies traditionally used in WMSes. The performance evaluation outcome confirms that the proposed approach significantly outperforms the other scheduling algorithms in a consistent manner and achieves the best execution runtime with the lowest number of failed tasks and communication costs.
机译:可以通过工作流描述来自简单数据分析任务的任何计算过程,以训练机器学习模型。存在许多工作流管理系统(WMS),该系统在分布式计算资源中进行调度工作流程的任务。在这项工作中,我们介绍了一种WM,利用机器学习来预测工作流任务运行时间以及任务分配失败的可能性。工作流任务的预期运行时间可用于近似于工作流程图的权重,而是在总工作流程工作负载方面的权重,预测任务失败的能力可以劝阻不可能成功的任务分配。我们证明,所提出的机器学习模型可以导致更明显更明显的调度决策,最小化任务故障并更有效地利用执行站点,从而导致工作流程运行时间减少。此外,我们通过增强学习培训修改的序列到序列神经网络架构,以执行作为WMS的一部分的调度决策。我们的方法介绍了一种WM,可以通过随着时间的推移,无需外部干预或依赖任何特定启发式或优化技术,从而通过独立学习来彻底改善其调度性能。最后,我们在现实世界场景中使用计算要求苛刻和数据密集型工作流程来测试我们的方法,并评估其对传统上用于WMSES的现有调度方法的性能。性能评估结果证实,所提出的方法以一致的方式显着优于其他调度算法,并实现了最低的任务和通信成本的最低执行运行时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号