首页> 外文学位 >xBFT: Byzantine fault tolerance with high performance, low cost, and aggressive fault isolation.
【24h】

xBFT: Byzantine fault tolerance with high performance, low cost, and aggressive fault isolation.

机译:xBFT:具有高性能,低成本和积极的故障隔离功能的拜占庭式容错功能。

获取原文
获取原文并翻译 | 示例

摘要

We are increasingly relying on online services to store, access, share, and disseminate critical information from anywhere and at all times. Such services include email, digital storage, photos, video, health and financial services, etc. With increasing evidence of non-fail-stop failures in practical systems, Byzantine fault tolerant state machine replication technique is becoming increasingly attractive for building highly-reliable services in order to tolerate such failures. However, existing Byzantine fault tolerant techniques fall short of providing high availability, high performance, and long-term data durability guarantees with competitive replication cost.;In this dissertation, we present BFT replication techniques that facilitate the design and implementation of such highly-reliable services by providing high availability, high performance and high durability with competitive replication cost (hardware, software, network, management).;First, we propose CBASE, a BFT state machine replication architecture that leverages application-level parallelism to improve throughput of the replicated system by identifying and executing independent requests concurrently. Traditional state machine replication based Byzantine fault tolerant (BFT) techniques provide high availability and security but fail to provide high throughput. This limitation stems from the fundamental assumption of generalized state machine replication techniques that all replicas execute requests sequentially in the same total order to ensure consistency across replicas. Our architecture thus provides a general way to exploit application parallelism in order to provide high throughput without compromising correctness.;Second, we present Zyzzyva, an efficient BFT agreement protocol that uses speculation to significantly reduce the performance overhead and replication cost of BFT state machine replication. In Zyzzyva, replicas respond to a client's request without first running an expensive three-phase commit protocol to reach agreement on the order in which the request must be processed. Instead, they optimistically adopt the order proposed by the primary and respond immediately to the client. Replicas can thus become temporarily inconsistent with one another, but clients detect inconsistencies, help correct replicas converge on a single total ordering of requests, and only rely on responses that are consistent with this total order. This approach allows Zyzzyva to reduce replication overheads to near their theoretical minima.;Third, we design and implement SafeStore, a distributed storage system designed to maintain long-term data durability despite conventional hardware and software faults, environmental disruptions, and administrative failures caused by human error or malice. The architecture of SafeStore is based on fault isolation, which SafeStore applies aggressively along administrative, physical, and temporal dimensions by spreading data across autonomous storage service providers (SSPs). SafeStore also performs an efficient end-to-end audit of SSPs to detect data loss quickly and improve data durability by reducing MTTR. SafeStore offers durable storage with cost, performance, and availability competitive with traditional storage systems.;We evaluate these techniques by implementing BFT replication libraries and further demonstrate the practicality of these approaches by implementing an NFS based replicated file system(CBASE-FS) and a durable storage system (SafeStore-FS).
机译:我们越来越依赖在线服务来随时随地存储,访问,共享和传播关键信息。这样的服务包括电子邮件,数字存储,照片,视频,健康和金融服务等。随着越来越多的证据表明在实际系统中出现不间断故障,拜占庭容错状态机复制技术对于构建高度可靠的服务越来越有吸引力。为了容忍这种失败。但是,现有的拜占庭式容错技术不能提供具有高可用性的高性能,高性能和长期数据持久性保证,并且具有竞争性的复制成本。在本文中,我们提出了BFT复制技术,该技术有助于这种高度可靠的设计和实现。通过提供高可用性,高性能和高耐用性以及具有竞争力的复制成本(硬件,软件,网络,管理)来提供服务。首先,我们提出CBASE,一种BFT状态机复制体系结构,该体系结构利用应用程序级并行性来提高复制的吞吐量。通过同时识别和执行独立请求来实现系统。基于传统状态机复制的拜占庭容错(BFT)技术可提供高可用性和安全性,但无法提供高吞吐量。此限制源自通用状态机复制技术的基本假设,即所有副本以相同的总顺序顺序执行请求以确保副本之间的一致性。因此,我们的体系结构提供了一种利用应用程序并行性的通用方法,以提供高吞吐量而又不影响正确性。其次,我们介绍了Zyzzyva,一种有效的BFT协议,该协议使用推测功能来显着降低BFT状态机复制的性能开销和复制成本。在Zyzzyva中,副本无需首先运行昂贵的三相提交协议就必须处理请求的顺序达成协议,就可以响应客户端的请求。取而代之的是,他们乐观地采用了主要机构提出的命令,并立即对客户做出回应。这样一来,副本之间可能会暂时变得不一致,但是客户端会检测到不一致之处,从而帮助纠正副本收敛于单个请求的总顺序,并且仅依赖于与此总顺序一致的响应。这种方法使Zyzzyva可以将复制开销减少到理论上的最低水平。第三,我们设计和实现了SafeStore,这是一种分布式存储系统,旨在尽管由于传统的硬件和软件故障,环境破坏和管理故障而导致长期数据持久性人为错误或恶意。 SafeStore的体系结构基于故障隔离,通过在自治存储服务提供商(SSP)之间分散数据,SafeStore在管理,物理和时间维度上积极应用。 SafeStore还对SSP执行有效的端到端审核,以快速检测数据丢失并通过减少MTTR来提高数据持久性。 SafeStore提供了与传统存储系统相比具有成本,性能和可用性的持久存储。;我们通过实现BFT复制库来评估这些技术,并通过实现基于NFS的复制文件系统(CBASE-FS)和存储库来进一步证明这些方法的实用性。持久存储系统(SafeStore-FS)。

著录项

  • 作者

    Kotla, Ramakrishna Rao.;

  • 作者单位

    The University of Texas at Austin.;

  • 授予单位 The University of Texas at Austin.;
  • 学科 Engineering Electronics and Electrical.;Computer Science.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 208 p.
  • 总页数 208
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号