...
首页> 外文期刊>ETRI journal >Combining replication and checkpointing redundancies for reducing resiliency overhead
【24h】

Combining replication and checkpointing redundancies for reducing resiliency overhead

机译:结合复制和检查点冗余来减少弹性开销

获取原文
           

摘要

We herein propose a heuristic redundancy selection algorithm that combines resubmission, replication, and checkpointing redundancies to reduce the resiliency overhead in fault‐tolerant workflow scheduling. The appropriate combination of these redundancies for workflow tasks is obtained in two consecutive phases. First, to compute the replication vector (number of task replicas), we apportion the set of provisioned resources among concurrently executing tasks according to their needs. Subsequently, we obtain the optimal checkpointing interval for each task as a function of the number of replicas and characteristics of tasks and computational environment. We formulate the problem of obtaining the optimal checkpointing interval for replicated tasks in situations where checkpoint files can be exchanged among computational resources. The results of our simulation experiments, on both randomly generated workflow graphs and real‐world applications, demonstrated that both the proposed replication vector computation algorithm and the proposed checkpointing scheme reduced the resiliency overhead.
机译:我们在本文中提出了一种启发式冗余选择算法,该冗余选择算法结合重新提交,复制和检查点冗余,以减少容错工作流程调度中的弹性开销。这些冗余的适当组合在两个连续的阶段获得了工作流任务。首先,要计算复制矢量(任务副本的数量),我们按照它们的需求并发执行任务之间的配置资源集。随后,我们将每个任务获得最佳检查点间隔,作为任务数量和任务和计算环境的特征的函数。我们制定在可以在计算资源之间交换检查点文件的情况下获得复制任务的最佳检查点间隔的问题。我们的仿真实验结果在随机产生的工作流程图和实际应用中,证明了所提出的复制矢量计算算法和所提出的检查点方案均降低了弹性开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号