...
首页> 外文期刊>Concurrency and Computation >The Neutralizer: a self-configurable failure detector for minimizing distributed storage maintenance cost
【24h】

The Neutralizer: a self-configurable failure detector for minimizing distributed storage maintenance cost

机译:中和器:一种可自行配置的故障检测器,可将分布式存储的维护成本降至最低

获取原文
获取原文并翻译 | 示例
           

摘要

To achieve high data availability or reliability in an efficient manner, distributed storage systems must detect whether an observed node failure is permanent or transient, and if necessary, generate replicas to restore the desired level of replication. Given the unpredictability of network dynamics, however, distinguishing permanent and transient failures is extremely difficult. Though timeout-based detectors can be used to avoid mistaking transient failures as permanent failures, it is unknown how the timeout values should be selected to achieve a better tradeoff between detection latency and accuracy. In this paper, we address this fundamental tradeoff from several perspectives. First, we explore the impact of different timeout values on maintenance cost by examining the probability of their false positives and false negatives. Second, we propose a self-configurable failure detector called the Neutralizer based on the idea of counteracting false positives with false negatives. The Neutralizer could enable the system to maintain a desired replication level on average with the least amount of bandwidth. We conduct extensive simulations using real trace data from a widely deployed peer-to-peer system and synthetic traces based on PlanetLab and Microsoft PCs, showing a significant reduction in aggregate bandwidth usage after applying the Neutralizer (especially in an environment with a low average node availability). Overall, we demonstrate that the Neutralizer closely approximates the performance of a perfect 'oracle' detector in many cases.
机译:为了以有效的方式实现高数据可用性或可靠性,分布式存储系统必须检测观察到的节点故障是永久的还是暂时的,并且在必要时生成副本以恢复所需的复制级别。但是,鉴于网络动力学的不可预测性,区分永久性故障和暂时性故障非常困难。尽管可以使用基于超时的检测器来避免将瞬态故障误认为是永久性故障,但如何选择超时值以在检测等待时间和准确性之间取得更好的平衡尚不明确。在本文中,我们从几个角度解决了这一基本权衡问题。首先,我们通过检查错误超时和错误否定的可能性来探讨不同超时值对维护成本的影响。其次,我们提出了一种可自我配置的故障检测器,称为“中和器”,它基于用假阴性抵消假阳性的想法。中和器可使系统以最小的带宽平均维持所需的复制级别。我们使用来自广泛部署的对等系统的真实跟踪数据以及基于PlanetLab和Microsoft PC的综合跟踪进行广泛的仿真,结果表明在应用Neutralizer之后,尤其是在平均节点数较低的环境中,聚合带宽的使用显着减少了可用性)。总体而言,我们证明了在许多情况下,中和器都可以完美地逼近完美的“ oracle”探测器的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号