...
首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Fault Tolerance in Distributed Systems Using Fused Data Structures
【24h】

Fault Tolerance in Distributed Systems Using Fused Data Structures

机译:使用融合数据结构的分布式系统中的容错

获取原文
获取原文并翻译 | 示例

摘要

Replication is the prevalent solution to tolerate faults in large data structures hosted on distributed servers. To tolerate $(f)$ crash faults (dead/unresponsive data structures) among $(n)$ distinct data structures, replication requires $(f+1)$ replicas of each data structure, resulting in $(nf)$ additional backups. We present a solution, referred to as fusion that uses a combination of erasure codes and selective replication to tolerate $(f)$ crash faults using just $(f)$ additional fused backups. We show that our solution achieves $(O(n))$ savings in space over replication. Further, we present a solution to tolerate $(f)$ Byzantine faults (malicious data structures), that requires only $(nf+f)$ backups as compared to the $(2nf)$ backups required by replication. We explore the theory of fused backups and provide a library of such backups for all the data structures in the Java Collection Framework. The theoretical and experimental evaluation confirms that the fused backups are space-efficient as compared to replication, while they cause very little overhead for normal operation. To illustrate the practical usefulness of fusion, we use fused backups for reliability in Amazon's highly available key-value store, Dynamo. While the current replication-based solution uses 300 backup structures, we present a solution that only requires 120 backup structures. This results in savings in space as well as other resources such as power.
机译:复制是容忍分布式服务器上托管的大型数据结构中的错误的流行解决方案。为了容忍$(n)$个不同数据结构之间的$(f)$个崩溃错误(失效/无响应的数据结构),复制需要每个数据结构的$(f + 1)$个副本,从而导致$(nf)$个额外的备份。我们提出了一种称为融合的解决方案,该解决方案结合了擦除代码和选择性复制,仅使用$(f)$个额外的融合备份就可以容忍$(f)$个崩溃错误。我们证明,与复制相比,我们的解决方案可节省$(O(n))$的空间。此外,我们提出了一种容忍$(f)$拜占庭式错误(恶意数据结构)的解决方案,与复制所需的$(2nf)$备份相比,该方案仅需要$(nf + f)$备份。我们探索融合备份的理论,并为Java Collection Framework中的所有数据结构提供此类备份的库。理论上和实验上的评估证实,与复制相比,融合备份的空间效率高,而正常操作的开销却很小。为了说明融合的实用性,我们使用融合备份来确保Amazon高可用性键值存储Dynamo中的可靠性。尽管当前基于复制的解决方案使用300个备份结构,但我们提出的解决方案仅需要120个备份结构。这样可以节省空间以及其他资源(例如电源)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号