首页> 外文期刊>Journal of Parallel and Distributed Computing >Achieving high data reliability at low scrubbing cost via failure-aware scrubbing
【24h】

Achieving high data reliability at low scrubbing cost via failure-aware scrubbing

机译:通过故障感知擦洗在低擦洗成本下实现高数据可靠性

获取原文
获取原文并翻译 | 示例
           

摘要

Latent Sector Errors (LSEs) happen at a significant frequency in the field and can impose a huge risk to data reliability. Disk scrubbing is a background process that reads disks periodically to detect LSEs timely, thus shortening the window of vulnerability to data loss. Nowadays, proactive error prediction, using machine learning techniques, has been proposed to improve storage system reliability by increasing the scrubbing rate for disks with higher error rates. Unfortunately, the majority of works incur non-trivial scrubbing costs and overlook the relationship between complete disk failures and LSEs. In this paper, we attempt to maintain or improve data reliability at reduced scrubbing costs. In particular, we design a novel adaptive approach that enforces a lower scrubbing rate for healthy disks and a higher scrubbing rate for disks which are subject to LSEs. Besides LSEs that are specific to partial disk failures, we also adjust scrubbing rates according to complete disk failure rates, because disks typically develop LSEs before they finally fail. Moreover, a voting-based method that exploits the periodic characteristic of scrubbing is proposed to ensure prediction accuracy. Experimental results on a real-world field dataset have demonstrated the effectiveness of our proposed approach. Specifically, the results show that we can achieve the same level of reliability, in terms of Mean-Time-To-Detection (MTTD), as the traditional fixed-rate scrubbing scheme with almost 49% less scrubbing costs or we can improve the reliability by a factor of 2.4X without extra scrubbing costs. Compared with the state-of-the-art approaches, our method can achieve the same level of reliability with nearly 32% less scrubbing costs.
机译:潜在扇区错误(LSE)在现场的一个显着频率下发生,可以对数据可靠性强烈造成巨大风险。磁盘擦洗是一个后台进程,其周期性地读取磁盘以及时检测LSE,从而缩短了数据丢失的漏洞窗口。如今,使用机器学习技术,已经提出了主动误差预测,通过提高具有更高误差率的磁盘的擦洗速率来提高存储系统可靠性。不幸的是,大多数工程都会产生非琐碎的擦洗成本并忽略了完整磁盘故障和LSE之间的关系。在本文中,我们试图在降低擦洗成本下维护或提高数据可靠性。特别地,我们设计一种新颖的自适应方法,该自适应方法能够为健康磁盘的较低擦洗速率和用于受到LSE的磁盘的更高擦洗速率。除了特定于部分磁盘故障的LSE之外,我们还根据完整的磁盘故障率调整擦除速率,因为磁盘通常在它们最终失败之前开发LSE。此外,提出了一种利用擦洗的周期性特性的基于投票的方法,以确保预测准确性。实验结果对现实世界的实地数据集表明了我们提出的方法的有效性。具体而言,结果表明,就平均定时(MTTD)而言,我们可以实现相同的可靠性,因为传统的固定速率擦洗方案,擦洗成本小49%,或者我们可以提高可靠性无需额外的擦洗成本,倍数为2.4倍。与最先进的方法相比,我们的方法可以实现相同的可靠性水平,擦洗成本较低的近32%。

著录项

  • 来源
    《Journal of Parallel and Distributed Computing》 |2020年第10期|220-229|共10页
  • 作者单位

    Wuhan National Laboratory for Optoelectronics Key Laboratory of Information Storage System (School of Computer Science and Technology Huazhong University of Science and Technology) Ministry of Education of China China;

    Wuhan National Laboratory for Optoelectronics Key Laboratory of Information Storage System (School of Computer Science and Technology Huazhong University of Science and Technology) Ministry of Education of China China;

    Wuhan National Laboratory for Optoelectronics Key Laboratory of Information Storage System (School of Computer Science and Technology Huazhong University of Science and Technology) Ministry of Education of China China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Hard disk; Reliability; Latent sector error; Scrubbing; Machine learning;

    机译:硬盘;可靠性;潜在扇区错误;擦洗;机器学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号