首页> 外文会议>USENIX Conference on File and Storage Technologies >An Analysis of Data Corruption in the Storage Stack
【24h】

An Analysis of Data Corruption in the Storage Stack

机译:存储堆栈中数据损坏分析

获取原文

摘要

An important threat to reliable storage of data is silent data corruption. In order to develop suitable protection mechanisms against data corruption, it is essential to understand its characteristics. In this paper, we present the first large-scale study of data corruption. We analyze corruption instances recorded in production storage systems containing a total of 1.53 million disk drives, over a period of 41 months. We study three classes of corruption: checksum mismatches, identity discrepancies, and parity inconsistencies. We focus on checksum mismatches since they occur the most. We find more than 400,000 instances of checksum mismatches over the 41-month period. We find many interesting trends among these instances including: (ⅰ) nearline disks (and their adapters) develop checksum mismatches an order of magnitude more often than enterprise class disk drives, (ⅱ) checksum mismatches within the same disk are not independent events and they show high spatial and temporal locality, and (ⅲ) checksum mismatches across different disks in the same storage system are not independent. We use our observations to derive lessons for corruption-proof system design.
机译:对可靠存储数据的重要威胁是静默数据损坏。为了制定适当的防止数据损坏机制,必须了解其特征。在本文中,我们提出了第一个对数据损坏的大规模研究。我们分析了在生产存储系统中记录的腐败实例,总共有153万磁盘驱动器,在41个月内。我们研究了三类腐败:校验和不匹配,身份差异和平等不一致。我们专注于校验和不匹配,因为它们发生了最多。在41个月期间,我们发现400,000多个校验和不匹配。我们在这些情况下发现了许多有趣的趋势,包括:(Ⅰ)接近磁盘(及其适配器)开发校验和比企业类磁盘驱动器更频繁的数量级,(Ⅱ)在同一磁盘内的校验和不匹配不是独立的事件显示高空间和时间位置,(Ⅲ)相同存储系统中不同磁盘的校验和不匹配不是独立的。我们使用我们的观察来导出腐败的系统设计的课程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号