首页> 外文会议>IEEE International Symposium on Policies for Distributed Systems and Networks >Improving Scientific Workflow Performance Using Policy Based Data Placement
【24h】

Improving Scientific Workflow Performance Using Policy Based Data Placement

机译:使用基于策略的数据展示位置提高科学工作流性能

获取原文

摘要

I/O intensive jobs such as stage-in, stage-out or data clean-up jobs account for significant time in execution of scientific workflows. Workflow managers typically add these data management operations as supporting jobs to computational tasks with scheduling emphasis on compute jobs only. We present the integration of the Pegasus Workflow Management System with a Policy Based Data Placement Service (PDPS) to reduce overall workflow execution time. Pegasus delegates all data staging jobs to PDPS, which schedules and executes stage-in jobs based on selected data placement policies and simply executes stage-out and clean-up jobs independent of the workflow execution state. We measure the impact of using PDPS with Pegasus first with the Montage workflow, and then with a synthetic workflow. We enforce two policies and demonstrate the advantage of using PDPS for asynchronous data placement for scientific workflows. Our results show that the influence of PDPS on the overall workflow runtimes is dependent on the data characteristics of the executable workflow and the data placement policy being enforced.
机译:I / O密集型的工作,如阶段中,阶段出或数据清理工作占了科学的工作流程的执行显著时间。流程经理通常添加这些数据管理操作的配套作业的计算任务与调度强调只计算作业。我们目前的飞马工作流管理系统的集成与基于策略的数据放置服务(PDP)的,以降低整体工作流程的执行时间。飞马代表所有数据分段作业,以PDP中,其时间表和执行阶段的工作基础上选定的数据放置策略和简单的执行阶段,并清理工作无关的工作流程执行状态。我们衡量首先使用PDPS与飞马与蒙太奇的工作流程的影响,然后用合成的工作流程。我们执行两个策略,证明了使用PDP中进行科学的工作流程异步数据放置的优势。我们的研究结果表明,在整个工作流程运行时的PDP的影响取决于执行的工作流程和数据放置策略的数据特征被强制执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号