首页> 外文会议>2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) >DEBAR: A scalable high-performance de-duplication storage system for backup and archiving
【24h】

DEBAR: A scalable high-performance de-duplication storage system for backup and archiving

机译:DEBAR:可扩展的高性能重复数据删除存储系统,用于备份和归档

获取原文
获取原文并翻译 | 示例

摘要

Driven by the increasing demand for large-scale and high-performance data protection, disk-based de-duplication storage has become a new research focus of the storage industry and research community where several new schemes have emerged recently. So far these systems are mainly inline de-duplication approaches, which are centralized and do not lend themselves easily to be extended to handle global de-duplication in a distributed environment. We present DEBAR, a de-duplication storage system designed to improve capacity, performance and scalability for de-duplication backup/archiving. DEBAR performs post-processing de-duplication, where backup streams are de-duplicated and cached on server-disks through an in-memory preliminary filter in phase I, and then completely de-duplicated in-batch in phase II. By decentralizing fingerprint lookup and update, DEBAR supports a cluster of servers to perform de-duplication backup in parallel, and is shown to scale linearly in both write throughput and physical capacity, achieving an aggregate throughput of 1.7GB/s and supporting a physical capacity of 2PB with 16 backup servers.
机译:在对大规模,高性能数据保护的需求不断增长的推动下,基于磁盘的重复数据删除存储已成为存储行业和研究界的新研究重点,最近又出现了几种新方案。到目前为止,这些系统主要是内联重复数据删除方法,它们是集中式的,并且不容易扩展以在分布式环境中处理全局重复数据删除。我们介绍了DEBAR,这是一种重复数据删除存储系统,旨在提高容量,性能和可伸缩性,以实现重复数据删除备份/归档。 DEBAR执行后处理重复数据删除,其中备份流在第一阶段通过内存中的初步过滤器进行重复数据删除并缓存在服务器磁盘上,然后在第二阶段通过批处理完全删除重复数据。通过分散指纹查找和更新,DEBAR支持服务器集群以并行执行重复数据删除备份,并显示出可在写入吞吐量和物理容量上线性扩展,实现了1.7GB / s的总吞吐量并支持物理容量。 2PB与16个备份服务器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号