首页> 外文期刊>Distributed Computing >Automation of fault-tolerant graceful degradation
【24h】

Automation of fault-tolerant graceful degradation

机译:容错适度降级的自动化

获取原文
获取原文并翻译 | 示例

摘要

Traditionally, (nonmasking and masking) fault-tolerance has focused on ensuring that after the occurrence of faults, the program recovers to states from where it continues to satisfy its original specification. However, a problem with this limited notion is that, in some cases, it may be impossible to recover to states from where the entire original specification is satisfied. For this reason, one can consider a fault-tolerant graceful-degradation program that ensures that upon the occurrence of faults, the program recovers to states from where a (given) subset of its specification is satisfied. Typically, the subset of specification satisfied thus would be the critical/important requirements. In this paper, we initially focus on automatically revising a given fault-intolerant program into a fault-tolerant gracefully degrading program. Specifically, we propose a two-step approach: In the first step, we transform the fault-intolerant program into a graceful program. This program is guaranteed to satisfy only the given subset of specification (e.g., critical requirements). In particular, this step involves adding new behaviors that will satisfy the given subset of the specification. The second step involves utilizing the original program and the graceful program to obtain a fault-tolerant gracefully degrading program. We also develop an algorithm to transform the gracefully degrading program into a distributed gracefully degrading program. Afterwards, the second phase of our transformation can be applied to generate a distributed fault-tolerant gracefully degrading program. We showcase the algorithm with three different non-trivial case studies. Finally, we formalize the problem of multi-graceful degradation and propose an algorithm that solves it and we use a complex case study to showcase the viability of the approach. All the algorithms have polynomial time complexity in the size of the state space of the original program.
机译:传统上,(非屏蔽和屏蔽)容错能力集中于确保故障发生后,程序恢复到继续满足其原始规格的状态。但是,这种有限概念的问题在于,在某些情况下,可能无法恢复到满足整个原始规格的状态。出于这个原因,可以考虑一个容错的优雅降级程序,该程序可确保在发生错误时,该程序恢复到满足其(给定)其规范子集的状态。通常,因此满足的规范子集将是关键/重要要求。在本文中,我们最初专注于自动将给定的容错程序修改为容错的性能下降程序。具体来说,我们提出了一种两步方法:在第一步中,我们将容错程序转换为正常程序。保证该程序仅满足给定的规格子集(例如,关键要求)。特别是,此步骤涉及添加满足规范给定子集的新行为。第二步涉及利用原始程序和优雅程序来获得容错的优雅降级程序。我们还开发了一种算法,用于将正常降级程序转换为分布式正常降级程序。然后,可以将转换的第二阶段应用于生成分布式容错的性能下降程序。我们通过三个不同的非平凡案例研究展示了该算法。最后,我们将多宽容降级的问题形式化,并提出一种解决该问题的算法,并使用一个复杂的案例研究来展示该方法的可行性。所有算法在原始程序状态空间的大小上都具有多项式时间复杂度。

著录项

  • 来源
    《Distributed Computing》 |2019年第1期|1-25|共25页
  • 作者单位

    Univ Warwick, Dept Comp Sci, Coventry, W Midlands, England;

    Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号