首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Design and Evaluation of a Risk-Aware Failure Identification Scheme for Improved RAS in Erasure-Coded Data Centers
【24h】

Design and Evaluation of a Risk-Aware Failure Identification Scheme for Improved RAS in Erasure-Coded Data Centers

机译:擦除编码数据中心改进RAS风险感知失效识别方案的设计与评估

获取原文
获取原文并翻译 | 示例
           

摘要

Data reliability and availability, and serviceability (RAS) of erasure-coded data centers are highly affected by data repair induced by node failures. In a traditional failure identification scheme, all chunks share the same identification time threshold, thus losing opportunities to further improve the RAS. To solve this problem, we propose RAFI, a novel risk-aware failure identification scheme. In RAFI, chunk failures in stripes experiencing different numbers of failed chunks are identified using different time thresholds. For those chunks in a high-risk stripe, a shorter identification time is adopted, thus improving the overall data reliability and availability. For those chunks in a low-risk stripe, a longer identification time is adopted, thus reducing the repair network traffic. Therefore, RAS can be improved simultaneously. We also propose three optimization techniques to reduce the additional overhead that RAFI imposes on management nodes and to ensure that RAFI can work properly under large-scale clusters. We use simulation, emulation, and prototyping implementation to evaluate RAFI from multiple aspects. Simulation and prototype results prove the effectiveness and correctness of RAFI, and the performance improvement of the optimization techniques on RAFI is demonstrated by running the emulator.
机译:擦除编码数据中心的数据可靠性和可用性以及可维护性(RAS)受节点故障引起的数据修复的高度影响。在传统的故障识别方案中,所有块共用相同的识别时间阈值,从而失去进一步改善RA的机会。为了解决这个问题,我们提出了一种新颖的风险感知失败识别方案的Rafi。在RAFI中,使用不同的时间阈值识别遇到遇到不同数量的故障块的条带中的块故障。对于高风险条带中的那些块,采用更短的识别时间,从而提高了整体数据可靠性和可用性。对于低风险条带中的那些块,采用了更长的识别时间,从而减少维修网络流量。因此,可以同时提高RAS。我们还提出了三种优化技术,以减少RAFI强加对管理节点的额外开销,并确保RAFI在大规模集群下可以正常工作。我们使用模拟,仿真和原型设计来从多个方面评估RAFI。仿真和原型结果证明了Rafi的有效性和正确性,通过运行仿真器来证明RAFI上的优化技术的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号