首页> 外文学位 >Efficient survivability for highly replicated services.
【24h】

Efficient survivability for highly replicated services.

机译:高复制服务的有效生存能力。

获取原文
获取原文并翻译 | 示例

摘要

Networked services like distributed file systems can suffer a wide range of problems such as machine crashes and security intrusions that may cause downtime, incorrect behavior, and other undesirable issues. Byzantine-fault-tolerant protocols that replicate the services represent catchall solutions for such problems by providing survivability---the ability of a service to operate correctly despite instances of such security or reliability "faults". These protocols use Byzantine quorum systems as building blocks in order to ensure that services operate correctly and are available to clients even in the presence of faults. However, there are a variety of Byzantine quorum systems, which differ in three primary measures: fault tolerance---a measure of the number of faults that can be tolerated; load---a measure of efficiency; and availability---a measure of how well the service remains available for use. Unfortunately, no quorum system excels in all categories.;In this dissertation, we show that, compared with previous quorum systems, probabilistic quorum systems can provide better fault tolerance and load albeit at the cost of admitting a bounded probability of failing to mask faults. We present a probabilistic opaque quorum system that can tolerate up to 37% more faults than can traditional opaque quorum systems. Then, we present a technique called write markers for probabilistic masking and opaque quorum systems that can tolerate 50% and 48% more faults, respectively, compared with traditional quorum systems. Moreover, these probabilistic quorum systems with write markers have asymptotically better load than the bounds proven for previous masking and opaque quorum systems, achieving an optimal O(1/ n ) in certain cases, and presenting the possibility of smaller quorums that yield efficient access for clients.;Additional contributions of this dissertation include the following. First, we present a framework for comparing Byzantine-fault-tolerant protocols on the basis of how they use quorum systems. This framework highlights the similarity between protocols that are explicitly quorum based, such as Q/U, and others like BFT and Zyzzyva that are not. Second, we introduce a way of improving the availability of probabilistic quorum systems by providing some leeway in how quorums are chosen. Instead of assuming that all quorums are chosen uniformly at random, we allow faulty clients to choose quorums by any strategy from access sets that are chosen uniformly at random. Third, we present the first analysis of a probabilistic quorum system that accounts for the behavior of Byzantine-faulty clients. We anticipate that a faulty client may choose quorums with the goal of maximizing the error probability, and show the effects that this may have. Fourth, we present a framework based on the McDiarmid inequality in order to prove that probabilistic quorum systems in general can meet specific load and fault tolerance targets. This framework allows us to prove that probabilistic masking quorum systems can tolerate up to 13% more faults than shown using Chernoff bounds previously. Fifth, we present a protocol by which probabilistic quorum systems can tolerate Byzantine-faulty clients. Such clients are otherwise problematic in that they may seek to cause the system to fail to mask faults. Finally, we analyze the cost of changing quorums routinely, as may be required by probabilistic quorum systems. This analysis is in the context of wide area networks with the Q/U protocol, which can require state to be transferred between servers as a result of quorum changes.
机译:诸如分布式文件系统之类的网络服务会遭受各种各样的问题,例如机器崩溃和安全入侵,这可能会导致停机,错误的行为以及其他不良问题。复制服务的拜占庭容错协议通过提供可生存性来表示针对此类问题的全面解决方案,即服务具有尽管存在此类安全性或可靠性“故障”的情况也能够正常运行的能力。这些协议使用拜占庭仲裁系统作为构建块,以确保服务正确运行并且即使在出现故障时也可供客户端使用。但是,有多种拜占庭仲裁系统,它们在以下三个主要方面有所不同:容错-一种可以容忍的故障数量的度量;负载-效率的量度;和可用性-衡量服务保持可用状态的程度。不幸的是,没有一个仲裁系统在所有类别中都表现出色。本文证明,与先前的仲裁系统相比,概率仲裁系统可以提供更好的容错能力和负载,尽管以承认无法掩盖故障的有限概率为代价。我们提出了一种概率不透明仲裁系统,与传统不透明仲裁系统相比,它可以容忍多达37%的故障。然后,我们提出一种称为写标记的技术,用于概率屏蔽和不透明仲裁系统,与传统仲裁系统相比,它们分别可以容忍多出50%和48%的故障。此外,这些具有写标记的概率定额系统比以前的掩蔽和不透明定额系统已证明的界限具有渐近更好的负载,在某些情况下可实现最佳O(1 / n),并提供了较小的定额,可为客户。本论文的其他贡献包括以下内容。首先,我们提供了一个框架,用于根据拜占庭容错协议使用仲裁系统的方式进行比较。该框架强调了明确基于仲裁的协议(例如Q / U)与其他协议(例如BFT和Zyzzyva)之间的相似性。其次,我们介绍了一种通过选择仲裁方式的方式来提高概率仲裁系统可用性的方法。代替假定所有仲裁均是随机选择的,我们允许有缺陷的客户端通过任何策略从随机选择一致的访问集中的任何策略中选择仲裁。第三,我们对概率仲裁系统的第一个分析进行了分析,该系统解释了拜占庭式故障客户的行为。我们预计有缺陷的客户可能会选择仲裁,以最大程度地提高错误概率,并显示可能产生的影响。第四,我们提出一个基于McDiarmid不等式的框架,以证明概率仲裁系统通常可以满足特定的负载和容错目标。该框架使我们能够证明,概率屏蔽定额系统比以前使用Chernoff边界显示的容错能力高出13%。第五,我们提出了一个协议,概率仲裁系统可以通过该协议来容忍拜占庭式故障客户。否则,此类客户端会出现问题,因为它们可能会导致系统无法掩盖故障。最后,我们分析了概率仲裁系统可能需要的定期更改仲裁的成本。此分析是在具有Q / U协议的广域网的情况下进行的,由于定额更改,可能需要在服务器之间传输状态。

著录项

  • 作者

    Merideth, Michael G.;

  • 作者单位

    Carnegie Mellon University.;

  • 授予单位 Carnegie Mellon University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 89 p.
  • 总页数 89
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号