首页> 外文会议>International conference on computational science and its applications >Content-Based Chunk Placement Scheme for Decentralized Deduplication on Distributed File Systems
【24h】

Content-Based Chunk Placement Scheme for Decentralized Deduplication on Distributed File Systems

机译:基于内容的分布式文件系统分散式重复数据删除块方案

获取原文

摘要

The rapid growth of data size causes several problems such as storage limitation and increment of data management cost. In order to store and manage massive data, Distributed File System (DFS) is widely used. Furthermore, in order to reduce the volume of storage, data deduplication schemes are being extensively studied. The data deduplication increases the available storage capacity by eliminating duplicated data. However, deduplication process causes performance overhead such as disk I/O. In this paper, we propose a content-based chunk placement scheme to increase deduplication rate on the DFS. To avoid performance overhead caused by deduplication process, we use lessfs in each chunk server. With our design, our system performs decentralized deduplication process in each chunk server. Moreover, we use consistent hashing for chunk allocation and failure recovery. Our experimental results show that the proposed system reduces the storage space by 60% than the system without consistent hashing.
机译:数据大小的快速增长引起了一些问题,例如存储限制和数据管理成本的增加。为了存储和管理海量数据,分布式文件系统(DFS)被广泛使用。此外,为了减少存储量,正在对重复数据删除方案进行广泛的研究。重复数据删除通过消除重复的数据来增加可用的存储容量。但是,重复数据删除过程会导致性能开销,例如磁盘I / O。在本文中,我们提出了一种基于内容的块放置方案,以提高DFS上的重复数据删除率。为了避免重复数据删除过程导致的性能开销,我们在每个块服务器中使用lessfs。通过我们的设计,我们的系统在每个块服务器中执行分散式重复数据删除过程。此外,我们将一致性哈希用于块分配和故障恢复。我们的实验结果表明,所提出的系统比没有一致性哈希的系统减少了60%的存储空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号