首页> 外文会议>Cluster Computing and the Grid, 2009. CCGRID '09 >Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids
【24h】

Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids

机译:容错与调度技术相结合的计算网格上的工作流应用

获取原文
获取原文并翻译 | 示例

摘要

Complex scientific workflows are now Increasingly executed on computational grids. In addition to the challenges of managing and scheduling these workflows, reliability challenges arise because of the unreliable nature of large-scale grid infrastructure. Fault tolerance mechanisms like over-provisioning and checkpoint-recovery are used in current grid application management systems to address these reliability challenges. In this work, we propose new approaches that combine these fault tolerance techniques with existing workflow scheduling algorithms. We present a study on the effectiveness of the combined approaches by analyzing their impact on the reliability of workflow execution, workflow performance and resource usage under different reliability models, failure prediction accuracies and workflow application types.
机译:现在,越来越复杂的科学工作流越来越在计算网格上执行。除了管理和调度这些工作流的挑战之外,由于大规模网格基础架构的不可靠特性,也带来了可靠性挑战。当前的网格应用程序管理系统中使用了诸如超置备和检查点恢复之类的容错机制来解决这些可靠性挑战。在这项工作中,我们提出了将这些容错技术与现有工作流程调度算法相结合的新方法。我们通过分析组合方法对工作流执行的可靠性,工作流性能和资源使用情况(在不同的可靠性模型,故障预测准确性和工作流应用程序类型下)的影响,来研究组合方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号