Unreliable Failure Detectors for Reliable Distributed Systems.

机译：用于可靠分布式系统的不可靠故障检测器。

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

It is well-known that Consensus, a fundamental problem of fault-tolerant distributed computing, cannot be solved in asynchronous systems with crash failures. This impossibility result stems from the lack of reliable failure detection in such systems. To circumvent such impossibility results, we introduce the concept of unreliable failure detectors that can make mistakes, and study the problem of using them to solve Consensus. We characterize unreliable failure detectors by two types of properties: completeness and accuracy. Informally, completeness requires that the failure detector eventually suspects every process that actually crashes, while accuracy restricts the mistakes that it can make. We define a hierarchy of failure detectors based on the strength of their accuracy. We determine which failure detectors in this hierarchy can be used to solve Consensus despite any number of crashes, and which ones require a majority of correct processes. We show that Consensus can be solved with weak failure detectors, i.e., failure detectors that make an infinite number of mistakes. This leads to the following question: What is the weakest failure detector for solving Consensus. In a companion paper, we show that OW, one of the failure detectors that we consider here, is the weakest failure detector for solving Consensus in asynchronous systems. In this paper, we show that Consensus and Atomic Broadcast are reducible to each other in asynchronous systems. Thus, all our results apply to Atomic Broadcast as well.

著录项

作者
Chandra, T. D.; Toueg, S.;
展开▼
作者单位

展开▼
年度 1993
页码 1-50
总页数 50
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Fault tolerant computing; Distributed data processing; Reliability(Electronics); Failure(Electronics); Accuracy; Asynchronous systems; Crashes; Detection; Detectors; Errors; Hierarchies;

机译：容错计算;分布式数据处理;可靠性（电子）;故障（电子）;精度;异步系统;崩溃;检测;检测器;错误;层次结构;

相似文献

外文文献
中文文献
专利

1. A Safe Election Protocol based on an Unreliable Failure Detector in Distributed Systems [J] . SungHoon Park Indian Journal of Science and Technology . 2015,第34期

机译：基于不可靠故障检测器的分布式系统安全选择协议
2. Quorum-based mutual exclusion in asynchronous distributed systems with unreliable failure detectors [J] . Sung-Hoon Park, Seon-Hyong Lee Journal of supercomputing . 2014,第2期

机译：具有不可靠故障检测器的异步分布式系统中基于仲裁的互斥
3. Asynchronous Communication under Reliable and Unreliable Network Topologies in Distributed Multiagent Systems: A Robust Technique for Computing Average Consensus [J] . Mustafa Ali, ul Islam Muhammad Najam, Ahmed Salman, Mathematical Problems in Engineering . 2018,第PTa3期

机译：分布式多代理系统中可靠和不可靠网络拓扑下的异步通信：一种计算平均共识的稳健技术
4. A Practical Election Protocol Based on an Unreliable Failure Detector in Distributed Systems [C] . Yong Hwan Cho, Sung-Hoon Park, Seon-Hyong Lee International conference on parallel and distributed processing techniques and applications . 2014

机译：分布式系统中基于不可靠故障检测器的实用选举协议
5. Achieving Scalable and Reliable Non-Intrusive Failure Reproduction in Distributed Systems by Enhancing the Event Chaining Approach [D] . Ren, Xiang. 2018

机译：通过增强事件链接方法在分布式系统中实现可扩展且可靠的非侵入性失败再现
6. Choice between reliable and unreliable reinforcement alternatives revisited: Preference for unreliable reinforcement [O] . Terry W. Belke, Marcia L. Spetch 1994

机译：重新研究了可靠和不可靠的钢筋替代方案：不可靠的钢筋的优先选择
7. Unreliable Failure Detectors for Reliable Distributed Systems [O] . Tushar Deepak Chandra, Sam Toueg 1996

机译：用于可靠分布式系统的不可靠故障检测器

Unreliable Failure Detectors for Reliable Distributed Systems.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅