首页> 外文会议>International conference on very large data bases >Fatman: Cost-saving and reliable archival storage based on volunteer resources
【24h】

Fatman: Cost-saving and reliable archival storage based on volunteer resources

机译:Fatman:基于志愿者资源的节省成本且可靠的档案存储

获取原文

摘要

We present Fatman, an enterprise-scale archival storage based on volunteer contribution resources from underutilized web servers, usually deployed on thousands of nodes with spare storage capacity. Fatman is specifically designed for enhancing the utilization of existing storage resources and cutting down the hardware purchase cost. Two major concerned issues of the system design are maximizing the resource u-tilization of volunteer nodes without violating Service Level Objectives (SLOs) and minimizing the cost without reducing the availability of archival system. Fatman has been widely deployed on tens of thousands of server nodes across several datacenters, provided more than 100PB storage capacity and served dozens of internal mass-data applications. The system realizes an efficient storage quota consolidation by strong isolation and budget limitation, to maximally support resources contribution without any degradation on host-level SLOs. It firstly improves data reliability by applying disk failure prediction to minish failure recovery cost, named fault-aware data management, dramatically reduces the MTTR by 76.3% and decreases file crash ratio by 35% on real-life product workload.
机译:我们介绍Fatman,这是一种企业级档案存储,基于未充分利用的Web服务器的自愿捐款资源,通常部署在具有备用存储容量的数千个节点上。 Fatman专为提高现有存储资源的利用率并降低硬件购买成本而设计。系统设计的两个主要相关问题是在不违反服务水平目标(SLO)的情况下最大程度地利用志愿者节点的资源,并在不降低归档系统可用性的情况下将成本最小化。 Fatman已在多个数据中心的数万个服务器节点上广泛部署,提供了超过100PB的存储容量,并为数十种内部海量数据应用程序提供服务。该系统通过强大的隔离和预算限制实现了有效的存储配额合并,以最大程度地支持资源贡献,而不会降低主机级SLO。它首先通过应用磁盘故障预测来减少故障恢复成本来提高数据可靠性(称为故障感知数据管理),从而在实际产品工作负载上将MTTR显着降低了76.3%,并将文件崩溃率降低了35%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号