首页> 外文期刊>Journal of supercomputing >Scheduling large-scale scientific workflow on virtual machines with different numbers of vCPUs
【24h】

Scheduling large-scale scientific workflow on virtual machines with different numbers of vCPUs

机译:在具有不同数量的VCPU上的虚拟机上调度大规模科学工作流程

获取原文
获取原文并翻译 | 示例
           

摘要

With the wide deployment of cloud computing in scientific computing, cost minimization is increasingly critical for large-scale scientific workflow. Unfortunately, due to the highly intricate directed acyclic graph (DAG)-based workflow and the flexible usage of virtual machines (VMs) in cloud platform, the existing workflow scheduling approaches are inefficient to strike a balance between the parallelism and the topology of the DAG-based workflow while using the VMs, which causes a low utilization of VMs and consumes more cost. To address these issues, this paper presents a novel task scheduling framework named cost minimization approach with the DAG splitting method (COMSE) for minimizing the cost of running a deadline-constrained large-scale scientific workflow. First, we provide comprehensive theoretical analyses on how to improve the utilization of a resource-balanced multi-vCPU VM for running multiple tasks simultaneously. Second, considering the balance between the parallelism and the topology of a workflow, we simplify the DAG-based workflow, and based on the simplified DAG, a DAG splitting method is devised to preprocess the workflow. Third, since the cloud is charged by hours, we also design an exact algorithm to find the optimal operation pattern for a given schedule to make the consumed instance hours minimum, and this algorithm is named as instance hours minimization by Dijkstra (TOID). Finally, by employing the DAG splitting method and the TOID, the COMSE schedules a deadline-constrained large-scale scientific workflow on the multi-vCPU VMs and incorporates two important objects: minimizing the computation cost and the communication cost. Our solution approach is evaluated through rigorous performance evaluation study using real-word workflows, and the results show that the proposed COMSE approach outperforms existing algorithms in terms of computation cost and communication cost.
机译:随着科学计算中的云计算的广泛部署,成本最小化对于大规模的科学工作流程越来越重要。遗憾的是,由于云平台中的高度复杂的无循环图(DAG)的工作流程和虚拟机(VMS)的灵活使用,现有的工作流程调度方法效率低下,以在平行和DAG的拓扑之间取得平衡使用VMS的工作流程,这会导致VM的利用率低,消耗更多成本。为了解决这些问题,本文提出了一种新的任务调度框架,名为成本最小化方法,具有DAG拆分方法(COMSE),以最大限度地减少运行截止日期约束的大规模科学工作流程的成本。首先,我们提供了全面的理论分析,了解如何提高资源平衡多VCPU VM的利用,同时运行多个任务。其次,考虑到并行性与工作流程的拓扑之间的平衡,我们简化了基于DAG的工作流程,并基于简化的DAG,设计了一种DAG分割方法来预处理工作流程。第三,由于云被收取了数小时,我们还设计了一个精确的算法,以找到给定计划的最佳操作模式,以使消耗的例子最小为最小,并且该算法被命名为Dijkstra(TOID)的实例小时最小化。最后,通过采用DAG拆分方法和TOID,COM​​SE计划多VCPU VM上的截止日期约束的大规模科学工作流程,并包含两个重要的对象:最小化计算成本和通信成本。我们的解决方案方法是通过使用实际处理流程的严格性能评估研究来评估的,结果表明,在计算成本和通信成本方面,所提出的COMSE方法优于现有算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号