...
首页> 外文期刊>ACM Transactions on Storage >A New Intra-disk Redundancy Scheme for High-Reliability RAID Storage Systems in the Presence of Unrecoverable Errors
【24h】

A New Intra-disk Redundancy Scheme for High-Reliability RAID Storage Systems in the Presence of Unrecoverable Errors

机译:存在不可恢复错误时用于高可靠性RAID存储系统的新磁盘内冗余方案

获取原文
获取原文并翻译 | 示例
           

摘要

Today's data storage systems are increasingly adopting low-cost disk drives that have higher capacity but lower reliability, leading to more frequent rebuilds and to a higher risk of unrecoverable media errors. We propose an efficient intradisk redundancy scheme to enhance the reliability of RAID systems. This scheme introduces an additional level of redundancy inside each disk, on top of the RAID redundancy across multiple disks. The RAID parity provides protection against disk failures, whereas the proposed scheme aims to protect against media-related unrecoverable errors. In particular, we consider an intradisk redundancy architecture that is based on an interleaved parity-check coding scheme, which incurs only negligible I/O performance degradation. A comparison between this coding scheme and schemes based on traditional Reed-Solomon codes and single-parity-check codes is conducted by analytical means. A new model is developed to capture the effect of correlated unrecoverable sector errors. The probability of an unrecoverable failure associated with these schemes is derived for the new correlated model, as well as for the simpler independent error model. We also derive closed-form expressions for the mean time to data loss of RAID-5 and RAID-6 systems in the presence of unrecoverable errors and disk failures. We then combine these results to characterize the reliability of RAID systems that incorporate the intradisk redundancy scheme. Our results show that in the practical case of correlated errors, the interleaved parity-check scheme provides the same reliability as the optimum, albeit more complex, Reed-Solomon coding scheme. Finally, the I/O and throughput performances are evaluated by means of analysis and event-driven simulation.
机译:当今的数据存储系统越来越多地采用低成本磁盘驱动器,这些驱动器具有更高的容量,但可靠性却更低,从​​而导致更频繁的重建以及更高的不可恢复媒体错误风险。我们提出了一种有效的磁盘内冗余方案,以增强RAID系统的可靠性。除了跨多个磁盘的RAID冗余之外,此方案还在每个磁盘内部引入了附加级别的冗余。 RAID奇偶校验可防止磁盘故障,而所提出的方案旨在防止与介质相关的不可恢复的错误。特别是,我们考虑了基于交错奇偶校验编码方案的磁盘内冗余架构,该架构只会导致可忽略的I / O性能下降。通过分析方法,将该编码方案与基于传统里德-所罗门码和单奇偶校验码的方案进行比较。开发了一种新模型来捕获相关的不可恢复的扇区错误的影响。对于新的相关模型以及更简单的独立错误模型,都得出了与这些方案相关的不可恢复故障的可能性。在存在不可恢复的错误和磁盘故障的情况下,我们还导出了RAID-5和RAID-6系统数据丢失平均时间的闭式表达式。然后,我们将这些结果结合起来,以表征结合了磁盘内冗余方案的RAID系统的可靠性。我们的结果表明,在实际的相关错误情况下,交错的奇偶校验方案提供了与最佳方案相同的可靠性,尽管更为复杂,但采用了Reed-Solomon编码方案。最后,通过分析和事件驱动的仿真来评估I / O和吞吐量性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号