首页> 外文期刊>Computer communication review >A 'Hitchhiker's' Guide to Fast and Efficient Data Reconstruction in Erasure-coded Data Centers
【24h】

A 'Hitchhiker's' Guide to Fast and Efficient Data Reconstruction in Erasure-coded Data Centers

机译:擦除编码数据中心中的快速高效数据重构的“旅行者指南”

获取原文
获取原文并翻译 | 示例
           

摘要

Erasure codes such as Reed-Solomon (RS) codes are being extensively deployed in data centers since they offer significantly higher reliability than data replication methods at much lower storage overheads. These codes however mandate much higher resources with respect to network bandwidth and disk 10 during reconstruction of data that is missing or otherwise unavailable. Existing solutions to this problem either demand additional storage space or severely limit the choice of the system parameters. In this paper, we present Hitchhiker, a new erasure-coded storage system that reduces both network traffic and disk IO by around 25% to 45% during reconstruction of missing or otherwise unavailable data, with no additional storage, the same fault tolerance, and arbitrary flexibility in the choice of parameters, as compared to RS-based systems. Hitchhiker "rides" on top of RS codes, and is based on novel encoding and decoding techniques that will be presented in this paper. We have implemented Hitchhiker in the Hadoop Distributed File System (HDFS). When evaluating various metrics on the data-warehouse cluster in production at Facebook with real-time traffic and workloads, during reconstruction, we observe a 36% reduction in the computation time and a 32% reduction in the data read time, in addition to the 35% reduction in network traffic and disk IO. Hitchhiker can thus reduce the latency of degraded reads and perform faster recovery from failed or decommissioned machines.
机译:诸如Reed-Solomon(RS)码之类的擦除码已在数据中心广泛部署,因为与数据复制方法相比,它们提供的可靠性高得多,而存储开销却低得多。然而,在丢失或以其他方式不可用的数据的重建期间,这些代码相对于网络带宽和磁盘10要求更高的资源。针对该问题的现有解决方案或者需要额外的存储空间,或者严重限制了系统参数的选择。在本文中,我们介绍了Hitchhiker,这是一种新的擦除编码存储系统,在丢失或以其他方式无法使用的数据的重建过程中,它可以将网络流量和磁盘IO降低约25%至45%,无需额外的存储,相同的容错能力以及与基于RS的系统相比,在参数选择方面具有任意灵活性。 Hitchhiker在RS代码之上“骑乘”,并且基于将在本文中介绍的新颖编码和解码技术。我们已经在Hadoop分布式文件系统(HDFS)中实现了Hitchhiker。当在Facebook上使用实时流量和工作负载评估生产中的数据仓库集群的各种指标时,在重建期间,除了可以节省36%的计算时间和32%的数据读取时间之外,网络流量和磁盘IO减少了35%。因此,旅行者可以减少读取质量下降的延迟,并从故障或退役的计算机中更快地恢复。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号