首页> 外文会议>IEEE Pacific Rim International Symposium on Dependable Computing >Combining Low IO-Operations During Data Recovery with Low Parity Overhead in Two-Failure Tolerant Archival Storage Systems
【24h】

Combining Low IO-Operations During Data Recovery with Low Parity Overhead in Two-Failure Tolerant Archival Storage Systems

机译:在数据恢复期间结合低IO-Operation,在两次故障普及存档存储系统中具有低奇偶频校验开销

获取原文

摘要

Archival data storage systems contain data that must be preserved over long periods of time but which are often unlikely to be accessed during their lifetime. The best strategy for such systems is to keep their disks powered-off unless they have to be powered up to access their contents, to reconstruct lost data, or to perform other disk maintenance tasks. Of all such tasks, reconstructing data after a disk failure is the one that is likely to have the highest energy footprint and the most impact on the overall power consumption of the array, because it typically involves powering up all the disks belonging to the same reliability stripe as the failed disk and keeping them running for considerable time at each occurrence. We investigate two two-failure tolerant disk layouts that have lower parity overhead than the number of disks read (and hence powered-on) for recovering data on lost drives would suggest. Our first organization is a flat XOR code that organizes the data disks into a rectangle with fewer rows than columns, and adds a simple parity disk to each row and column. Recovery from a disk failure proceeds by prefering columns when reconstructing lost data, and thereby has fewer reads than the parity overhead would normally suggest. Our second layout is based on the most basic pyramid code. We can view this layout as an example RAID Level 6 variant. In this variant, a stripe has a Q-parity calculated from the data disks in the stripe, but the data disks are also organized into smaller groups where each group has a separate P-parity calculated as the exclusive-or of the data disks in the group. We compare the two layouts by measuring their robustness to data loss, their one-year survival rate, and the expected number of number of disks that must be involved to recover from both single and multiple disk failures. Our results show that rectangular layouts are significantly more reliable than layouts based on the most basic Pyramid codes, but that they also require more disk accesses to recover from disk failures.
机译:归档数据存储系统包含必须长时间保留的数据,但在其使用寿命期间通常不太可能被访问。此类系统的最佳策略是保持其磁盘断电,除非它们必须被通电以访问其内容,以重建丢失的数据,或执行其他磁盘维护任务。在所有这样的任务中,在磁盘故障后重建数据是可能具有最高能量足迹的数据以及对阵列的整体功耗的影响最大,因为它通常涉及将所有属于相同可靠性的所有磁盘供电条纹作为失败的磁盘,并使它们在每次发生时运行相当长的时间。我们调查了两个失败的宽容磁盘布局,其具有比读取的磁盘数量较低的奇偶校验磁盘布局(并且为此通电)来恢复丢失的驱动器上的数据建议。我们的第一个组织是一个平面的XOR代码,将数据磁盘组织成矩形,矩形较少的矩形,并将简单的奇偶校验磁盘添加到每个行和列。通过在重建丢失数据时首选列来恢复从磁盘故障进行,从而具有比通常建议的奇偶校验开销更少的读取。我们的第二个布局基于最基本的金字塔代码。我们可以将此布局视为示例RAID级别6变量。在该变体中,条带具有从条带中的数据磁盘计算的Q-奇偶校验,但是数据磁盘也被组织成较小的组,其中每个组具有作为数据磁盘计算的单独的p奇偶校验。群组。我们通过测量其对数据丢失的鲁棒性,一年生存率以及必须涉及的磁盘数量的预期磁盘数量来进行比较两种布局。我们的结果表明,基于最基本的金字塔代码,矩形布局明显比布局更可靠,但它们还需要更多磁盘访问从磁盘故障中恢复。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号