首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >On the Speedup of Recovery in Large-Scale Erasure-Coded Storage Systems
【24h】

On the Speedup of Recovery in Large-Scale Erasure-Coded Storage Systems

机译:大型擦除编码存储系统的恢复加速

获取原文
获取原文并翻译 | 示例

摘要

Modern storage systems stripe redundant data across multiple nodes to provide availability guarantees against node failures. One form of data redundancy is based on XOR-based erasure codes, which use only XOR operations for encoding and decoding. In addition to tolerating failures, a storage system must also provide fast failure recovery to reduce the window of vulnerability. This work addresses the problem of speeding up the recovery of a single-node failure for general XOR-based erasure codes. We propose a replace recovery algorithm, which uses a hill-climbing technique to search for a fast recovery solution, such that the solution search can be completed within a short time period. We further extend the algorithm to adapt to the scenario where nodes have heterogeneous capabilities (e.g., processing power and transmission bandwidth). We implement our replace recovery algorithm atop a parallelized architecture to demonstrate its feasibility. We conduct experiments on a networked storage system testbed, and show that our replace recovery algorithm uses less recovery time than the conventional recovery approach.
机译:现代存储系统在多个节点上对冗余数据进行条带化,以提供针对节点故障的可用性保证。数据冗余的一种形式是基于基于XOR的擦除码,该擦除码仅使用XOR操作进行编码和解码。除了容忍故​​障,存储系统还必须提供快速的故障恢复功能,以减少漏洞窗口。这项工作解决了加快基于通用XOR的擦除码的单节点故障恢复的问题。我们提出了一种替换恢复算法,该算法使用爬山技术来搜索快速恢复解决方案,从而使解决方案搜索可以在短时间内完成。我们进一步扩展算法以适应节点具有异构功能(例如处理能力和传输带宽)的场景。我们在并行化架构之上实现替换恢复算法,以证明其可行性。我们在网络存储系统测试平台上进行了实验,结果表明,与传统的恢复方法相比,我们的替换恢复算法使用的恢复时间更少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号