首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Designing Fault Tolerance Strategy by Iterative Redundancy for Component-Based Distributed Computing Systems
【24h】

Designing Fault Tolerance Strategy by Iterative Redundancy for Component-Based Distributed Computing Systems

机译:基于组件的分布式计算系统的迭代冗余设计容错策略

获取原文
           

摘要

Reliability is a critical issue for component-based distributed computing systems, some distributed software allows the existence of large numbers of potentially faulty components on an open network. Faults are inevitable in this large-scale, complex, distributed components setting, which may include a lot of untrustworthy parts. How to provide highly reliable component-based distributed systems is a challenging problem and a critical research. Generally, redundancy and replication are utilized to realize the goal of fault tolerance. In this paper, we propose a CFI (critical fault iterative) redundancy technique, by which the efficiency can be guaranteed to make use of resources (e.g., computation and storage) and to create fault-tolerance applications. When operating in an environment with unknown components’ reliability, CFI redundancy is more efficient and adaptive than other techniques (e.g., K-Modular Redundancy and N-Version Programming). In the CFI strategy of redundancy, the function invocation relationships and invocation frequencies are employed to rank the functions’ importance and identify the most vulnerable function implemented via functionally equivalent components. A tradeoff has to be made between efficiency and reliability. In this paper, a formal theoretical analysis and an experimental analysis are presented. Compared with the existing methods, the reliability of components-based distributed system can be greatly improved by tolerating a small part of significant components.
机译:对于基于组件的分布式计算系统,可靠性是一个关键问题,某些分布式软件允许在开放网络上存在大量潜在的故障组件。在这种大规模的,复杂的,分布式组件设置中,故障是不可避免的,其中可能包括许多不可靠的部分。如何提供高度可靠的基于组件的分布式系统是一个具有挑战性的问题,也是一项至关重要的研究。通常,利用冗余和复制来实现容错的目标。在本文中,我们提出了一种CFI(关键故障迭代)冗余技术,通过该技术可以保证效率以利用资源(例如计算和存储)并创建容错应用程序。在组件可靠性未知的环境中运行时,CFI冗余比其他技术(例如,K-Modular Redundancy和N-Version Programming)更有效和更具适应性。在CFI冗余策略中,使用功能调用关系和调用频率来对功能的重要性进行排名,并确定通过功能等效组件实现的最易受攻击的功能。必须在效率和可靠性之间进行权衡。本文提出了形式化的理论分析和实验分析。与现有方法相比,通过容忍一小部分重要组件,可以大大提高基于组件的分布式系统的可靠性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号