...
首页> 外文期刊>Future generation computer systems >A balanced scheduler with data reuse and replication for scientific workflows in cloud computing systems
【24h】

A balanced scheduler with data reuse and replication for scientific workflows in cloud computing systems

机译:具有数据重用和复制功能的平衡调度程序,用于云计算系统中的科学工作流

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Cloud computing provides substantial opportunities to researchers who demand pay-as-you-go computing systems. Although cloud provider (e.g., Amazon Web Services) and application provider (e.g., biologists, physicists, and online gaming companies) both have specific performance requirements (e.g. application response time), it is the cloud scheduler's responsibility to map the application to underlying cloud resources. This article presents a Balanced and file Reuse-Replication Scheduling (BaRRS) algorithm for cloud computing environments to optimally schedule scientific application workflows. BaRRS splits scientific workflows into multiple sub-workflows to balance system utilization via parallelization. It also exploits data reuse and replication techniques to optimize the amount of data that needs to be transferred among tasks at run-time. BaRRS analyzes the key application features (e.g., task execution times, dependency patterns and file sizes) of scientific workflows for adapting existing data reuse and replication techniques to cloud systems. Further, BaRRS performs a trade-off analysis to select the optimal solution based on two optimization constraints: execution time and monetary cost of running scientific workflows. BaRRS is compared with a state-of-the-art scheduling approach; experiments prove its superior performance. Experiments include four well known scientific workflows with different dependency patterns and data file sizes. Results were promising and also highlighted most critical factors affecting execution of scientific applications on clouds.
机译:云计算为需要按需付费计算系统的研究人员提供了大量机会。尽管云提供商(例如,Amazon Web Services)和应用程序提供商(例如,生物学家,物理学家和在线游戏公司)都具有特定的性能要求(例如,应用程序响应时间),但是云调度程序负责将应用程序映射到底层云资源。本文提出了一种针对云计算环境的平衡和文件重用-复制调度(BaRRS)算法,以最佳地调度科学应用工作流程。 BaRRS将科学工作流分为多个子工作流,以通过并行化平衡系统利用率。它还利用数据重用和复制技术来优化运行时需要在任务之间传输的数据量。 BaRRS分析了科学工作流程的关键应用程序功能(例如任务执行时间,依赖关系模式和文件大小),以使现有数据重用和复制技术适应云系统。此外,BaRRS根据两个优化约束执行权衡分析以选择最佳解决方案:执行时间和运行科学工作流的金钱成本。将BaRRS与最新的调度方法进行了比较;实验证明了其优越的性能。实验包括四个众所周知的科学工作流程,它们具有不同的依赖模式和数据文件大小。结果令人鼓舞,并突出显示了影响云上科学应用程序执行的最关键因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号