首页> 外文期刊>Future generation computer systems >Workflow performance improvement using model-based scheduling over multiple clusters and clouds
【24h】

Workflow performance improvement using model-based scheduling over multiple clusters and clouds

机译:使用基于模型的计划在多个集群和云上提高工作流程性能

获取原文
获取原文并翻译 | 示例
       

摘要

In recent years, a variety of computational sites and resources have emerged, and users often have access to multiple resources that are distributed. These sites are heterogeneous in nature and performance of different tasks in a workflow varies from one site to another. Additionally, users typically have a limited resource allocation at each site capped by administrative policies. In such cases, judicious scheduling strategy is required in order to map tasks in the workflow to resources so that the workload is balanced among sites and the overhead is minimized in data transfer. Most existing systems either run the entire workflow in a single site or use naive approaches to distribute the tasks across sites or leave it to the user to optimize the allocation of tasks to distributed resources. This results in a significant loss in productivity. We propose a multi-site workflow scheduling technique that uses performance models to predict the execution time on resources and dynamic probes to identify the achievable network throughput between sites. We evaluate our approach using real world applications using the Swift parallel and distributed execution framework. We use two distinct computational environments-geographically distributed multiple clusters and multiple clouds. We show that our approach improves the resource utilization and reduces execution time when compared to the default schedule.
机译:近年来,出现了各种计算站点和资源,并且用户经常可以访问分布的多个资源。这些站点本质上是异构的,工作流中不同任务的执行情况在一个站点之间会有所不同。此外,用户通常在每个站点的管理策略限定的资源分配有限。在这种情况下,需要明智的调度策略,以便将工作流中的任务映射到资源,以便在站点之间平衡工作量,并最大程度地减少数据传输中的开销。大多数现有系统要么在单个站点中运行整个工作流,要么使用幼稚的方法在各个站点之间分配任务,或者将其留给用户以优化任务对分布式资源的分配。这导致生产率的重大损失。我们提出了一种多站点工作流调度技术,该技术使用性能模型来预测资源上的执行时间,并使用动态探针来确定站点之间可实现的网络吞吐量。我们使用Swift并行和分布式执行框架,使用实际应用程序评估我们的方法。我们使用两个不同的计算环境-地理分布的多个群集和多个云。我们证明,与默认计划相比,我们的方法可以提高资源利用率并减少执行时间。

著录项

  • 来源
    《Future generation computer systems》 |2016年第1期|206-218|共13页
  • 作者单位

    Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL, 60439, USA;

    Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL, 60439, USA;

    Leadership Computing Facility Division, Argonne National Laboratory, Argonne, IL, 60439, USA;

    Leadership Computing Facility Division, Argonne National Laboratory, Argonne, IL, 60439, USA;

    Leadership Computing Facility Division, Argonne National Laboratory, Argonne, IL, 60439, USA;

    Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL, 60439, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    System modeling; Workflow; Optimization; Swift; Clouds;

    机译:系统建模;工作流程;优化;迅速;乌云;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号